SEQUENCE LISTING 



<110> University of Iowa Research Foundation et al . 

5 

<120> Novel polypeptides and methods of their use 

<130> 875.006WO1 

10 <150> US 50/105,575 

<151> 1998-10-26 

<160> 65 

15 <170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2265 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<22 3> A modified DNA sequence 



25 <400> 1 





gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 


30 


aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 


35 


ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


GCCCtCCtCt 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 


40 gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 
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ca tgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 


5 


cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacggtc 


999<^ctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 


10 ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


15 


tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg aaaattggtc 


2100 


20 cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 2 
25 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

30 <223> A polypeptide encoded by SEQ ID N0:1 

<400> 2 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

35 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
40 50 55 60 



lie Vai Ser Arg Gin Asn Asp Gly Ser lie Arg Ala Phe Leu Asn Val 
65 *. 70 75 80 

Cys Arg His Arg Gly Lys Thr L.eu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

5 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Vai Glu Ser Phe 
10 130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

15 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 
20 210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

25 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 , 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
30 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

35 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
40 370 375 380 
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Ser Asn Leu Gly Phe Gly Glu Asp 
385 •. 390 

Gly Val Val Gly Lys Ser Ala lie 
405 

5 Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
435 440 

Arg 

10 



Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 3 
<211> 9841 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 



20 <400> 3 



gaattcatca 


ggaagacatt 


caaatgaacg 


taaacaataa 


gggcagcgtc 


tgtatttgcg 


60 


gcagcgaaat 


gctccctaaa 


ttcctcattt 


accccatctg 


aggattgctt 


tatgacagta 


120 


aagtggattg 


aagcagtcgc 


tctttctgac 


atccttgaag 


gtgacgtcct 


cggcgtgact 


180 


gtcgagggca 


a^ggagctggc 


gctgtatgaa 


gttgaaggcg 


aaatctacgc 


taccgacaac 


240 


25ctgtgcacgc 


atggttccgc 


ccgcatgagt 


gatggttatc 


tcgagggtag 


agaaatcgaa 


300 


tgccccttgc 


atcaaggtcg 


gtttgacgtt 


tgcacaggca 


aagccctgtg 


cgcacccgtg 


360 


acacagaaca 


tcaaaacata 


tccagtcaag 


attgagaacc 


tgcgcgtaat 


gattgattfcg 


420 


agctaagaat 


tttaacagga 


ggcaccccgg 


gccctagagc 


gtaatcaccc 


ccattccatc 


480 


ttttttaggt 


gaaaacatga 


attacaataa 


taaaatcttg 


gtaagtgaat 


ctggtctgag 


540 


30ccaaaagcac 


ctgattcatg 


gcgatgaaga 


acttttccaa 


catgaactga 


aaaccatttt 


600 


tgcgcggaac 


tggctttttc 


tcactcatga 


tagcctgatt 


cctgcccccg 


gcgactatgt 


660 


taccgcaaaa 


atggggattg 


acgaggtcat 


cgtctcccgg 


cagaacgacg 


gttcgattcg 


720 


tgcttttctg 


aacgtttgcc 


ggcatcgtgg 


caagacgctg gtgagcgtgg 


aagccggcaa 


780 


tgccaaaggt 


tttgtttgca 


gctatcacgg 


ctggggcttc 


ggctccaacg 


gtgaactgca 


840 


35gagcgttcca 


tttgaaaaag 


atctgtacgg 


cgagtcgctc 


aataaaaaat 


gtctggggtt 


900 


gaaagaagtc 


gctcgcgtgg 


agagcttcca 


tggcttcatc 


tacggttgct 


tcgaccagga 


960 


ggcccctcct 


cttatggact 


atctgggtga 


cgctgcttgg 


tacctggaac 


ctatgttcaa 


1020 


gcattccggc 


ggtttagaac 


tggtcggtcc 


tccaggcaag gttgtgatca 


aggccaactg 


1080 


gaaggcaccc 


gcggaaaact 


ttgtgggaga 


tgcataccac 


gtgggttgga 


cgcacgcgtc 


1140 


40ttcgcttcgc 


tcgggggagt 


ctatcttctc 


gtcgctcgct ggcaatgcgg 


cgctaccacc 


1200 
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tgaaggcgca 


ggcttgcaaa 


tgacctccaa 


atacggcagc 


ggcatgggtg 


tgttgtggga 


1260 


cggatattca 


ggtgtgcata 


gcgcagactt 


ggttccggaa 


ttgatggcat 


tcggaggcgc 


1320 


aaagcaggaa 


aggctgaaca 


aagaaattgg 


cgatgttcgc 


gctcggattt 


atcgcagcca 


1380 


cctcaactgc 


accgttttcc 


cgaacaacag 


catgctgacc 


tgctcgggtg 


ttttcaaagt 


1440 


5atggaacccg 


atcgacgcaa 


acaccaccga 


ggtctggacc 


tacgccattg 


tcgaaaaaga 


1500 


catgcctgag 


gatctcaagc 


gccgcttggc 


cgactctgtt 


cagcgaacgg 


tcgggcctgc 


1560 


tggcttctgg 


gaaagcgacg 


acaatgacaa 


tatggaaaca 


gcttcgcaaa 


acggcaagaa 


1620 


atatcaatca 


agagatagtg 


atctgctttc 


aaaccttggt 


ttcggtgagg 


acgtatacgg 


1680 


cgacgcggtc 


tatccaggcg 


tcgtcggcaa 


atcggcgatc 


ggcgagacca 


gttatcgtgg 


1740 


lOtttctaccgg 


gcttaccagg 


cacacgtcag 


cagctccaac 


tgggctgagt 


tcgagcatgc 


1800 


ctctagtact 


tggcatactg 


aacttacgaa 


gactactgat 


cgctaacaga 


cgagtcgacc 


1860 


atgatgatca 


atattcaaga 


agacaagctg 


gtttccgccc 


acgacgccga 


agagattctt 


1920 


cgtttcttca 


attgccacga 


ctctgctttg 


caacaagaag 


ccactacgct 


gctgacccag 


1980 


gaagcgcatt 


tgttggacat 


tcaggcttac 


cgtgcttggt 


tagagcactg 


cgtggggtca 


2040 


25gaggtgcaat 


atcaggtcat 


ttcacgcgaa 


ctgcgcgcag 


cttcagagcg 


tcgttataag 


2100 


ctcaatgaag 


ccatgaacgt 


ttacaacgaa 


aattttcagc 


aactgaaagt 


tcgagttgag 


2160 


catcaactgg 


atccgcaaaa 


ctggggcaac 


agcccgaagc 


tgcgctttac 


tcgctttatc 


2220 


accaacgtcc 


aggccgcaat 


ggacgtaaat 


gacaaagagc 


tacttcacat 


cagctccaac 


2280 


gtcattctgc 


accgggcacg 


acgtggcaat 


caggtcgatg 


tcttctacgc 


cgcccgggaa 


2340 


20gafcaaatgga 


aacgtggcga 


aggtggagta 


cgaaaattgg 


tccagcgatt 


cgtcgattac 


2400 


ccagagcgca 


tacttcagac 


gcacaatctg 


atggtctttc 


tgtgattcag 


tgaccatttt 


2460 


tacaaatggt 


cactgcaacc 


gcggtcacca 


ttaatcaaag 


ggaatgtacg 


tgtatgggca 


2520 


atcaacaagt 


cgtttcgata 


accggtgcag 


gctcaggaat 


cggtctcgaa 


ctggttcggt 


2580 


cctttaagtc 


ggccggttat 


tacgtatccg 


ctctcgtacg 


aaacgaggag 


caagaggcgc 


2640 


25ttctttgcaa 


agagttcaag 


gacgcactcg 


agattgtagt 


gggcgatgtc 


cgggaccacg 


2700 


caacaaatga 


gaagctgata 


aagcaaacaa 


tcgatagatt 


cggtcatctt 


gattgtttta 


2760 


ttgcaaatgc 


cggtatctgg 


gattacatgc 


tgagcatcga 


agagccttgg 


gagaaaatat 


2820 


cgagcagttt 


tgacgaaata 


ttcgacatta 


atgtcaagag 


ctatttcagt 


ggcatcagtg 


2880 


ccgccctgcc 


ggaactgaaa 


aagactaacg 


gatcagtggt 


gatgaccgct 


tcggtgtcgt 


2940 


30cccatgcggt 


cggtggtggt 


ggttcttgct 


acatcgccag 


caagcatgcg 


gtgctcggta 


3000 


tggttaaggc 


tttggcctac 


gaattggccc 


ccgaagttcg 


cgtgaacgct 


gtttcgccgg 


3060 


ggggcaccgt 


gacgtctctg 


tgcggtcccg 


cgagcgccgg 


tttcgacaaa 


atgcacatga 


3120 


aagacatgcc 


cggcatcgac 


gatatgatca 


aaggtctcac 


gcctcttggg 


tttgcagcca 


3180 


agcccgaaga 


cgtggtggca 


CGCtatttgt 


tgctggcttc 


gcgaaagcaa 


ggaaaattca 


3240 


35tcaccggcac 


cgtgattagc 


attgatggcg 


gtatggcgct 


cggtcgcaag 


tgagcttgta 


3300 


gccgatcaga 


agttatagac 


acatttcagg 


tgacgcccca 


tgaagacaaa 


actgtttatc 


3360 


aataacgcct 


ggatcgattc 


tagtgaccag 


cagaccttcg 


agcgcataca 


ccccgtcagc 


3420 


agcgatgtgg 


tgactgagag 


cgcaaacgcc 


acagtgacgg 


acgcgataaa 


ggcggcgcaa 


3480 


gcggccgagg 


aggcgttcaa 


gacctggaag 


gccgttggac 


cttcagagcg 


tcgccgcctt 


3540 


40ctcctaaa9g 


tcgccgatgt 


catggaaagt 


aaaacaccca 


agttcatcga 


agtgatggcc 


3600 
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atggaggtgg gagcttccgc cctttgggcc ggabtcaacg tccatgcgtc tgccaatgtg 3660 

ttccgagagg ctgcctcgct ggctacccaa attcagggtg aaaccatccc aacggacaaa 372 0 

gccgaaacgc tctcaatgac actacgtcag ccggtcggcc cgatcctaag catcgttcca 3780 

tggaacggca ccgcagtgct tgcggcacga gccatcgctt atccgctggt ctgtggcaac 3 840 

Sactgtggtgt tcaaaggctc tgaatttagt cccgcgacgc atgccctgat cacccagtgc 3900 

gtgcaggaag ccgggctgcc cgctggcgtg ctcaattacc tcaactcttc gcctgaccgt 3 960 

tcgcccgaga tcgctgacgc actgatctct gccaaggaga tccgccgcat caacttcacg 402 0 

ggttccaccc gcgtgggcag cattatcgcg cagaaagccg cgcaacacct caagcgctgc 4080 

ctgctggagc tcggcggcaa gtccccgctt attgttctgg atgatgcaga catcgatgcg 4140 

lOgcggtcaagg cagcggtgtt cggtagcttc ctgttccaag gtcagatctg catgtccact 4200 

gagcgctfcga tcgttgatga gaagatagcc gacgaatttg tcgcaaaatt tgtcgaaaaa 4260 

actaagcgct tgagcgcagg cgacccgtgc gtaactggcg actgcatcat cggcccgatg 432 0 

gtctcgccaa attcgggtga gcggatcaat ggtttgttca aagacgcgat cgacaaaggg 43 80 

gcaaaagttg tttgcggcgg cttggcccaa ggtgcgctca tgccggccac gatcctggat 4440 

IScacgtcaaat ctgacatgcg gatttacgat gaggagacct ttggtcccat caccgtggta 4500 

atccgttgta aaggcgaagc agaggccgtc cgcattgcca acgacagcgt ctatggcctg 4560 

tcgtcgggcg tatttggccg cgacatcaac cgcgctctac gcgtgggtat gtccatcgaa 4620 

tatggttctg tacacatcaa cggttcgacc gtccagaacg aggcgcaggc tccttacgga 4680 

ggcaccaaga acaccggcta cgggcgcttc gacggccgtg ctgtaatcga cgagttcaca 4740 

lOgagatcaagt ggctgaccat cgaacctttc gagcagcaat atcccttctg ataagcacta 4800 

actcccagga atcaaactat gagtaagcaa gctgcagtta tcgagctcgg atacatgggt 4860 

atctcggtca aggaccctga tgcgtggaaa tcatttgcca cggatatgct aggtctgcaa 4920 

gttcttgatg agggtgagaa ggaccgtttc tatctgcgga tggattactg gcatcatcgg 4980 

atcgtagtcc atcacaacgg acaggacgac ttggagtacc taggctggcg tgtagccggc 5040 

25aagccggagt tcgaagctct gggtcaaaag cttattgatg ccggttacaa gatccgcatc 5100 

tgc^acaaag ttgaggctca ggagcgtatg gtgttgggtc tgatgaagac agaagatccg 5160 

g9C9g<^aacc cgaccgagat attctggggc ccccggatcg acatgagcaa cccgttccat 5220 

cccggtcgcc ccctgcacgg aaagtttgtg accggtgacc aaggcttggg ccattgcatc 5280 

gttcgccaaa ccgacgtcgc agaagctcat aagttttata gcctgctggg cttccgtggg 5340 

30gacgtcgaat accggattcc gttgcccaac ggcatgactg ccgaactgtc gttcatgcat 5400 

tgcaacgccc gtgatcactc cattgctttt ggtgccatgc ccgctgccaa acgactcaat 5460 

cacttgatgc ttgagtacac ccatatggaa gacttgggat acacgcacca acagtttgta 5520 

aagaacgaaa ttgacattgc cttgcagctt ggcattcacg ccaacgacaa ggcgttgacg 5580 

ttctatggtg caacgccttc gggctggctc attgagcccg gctggcgagg tgccacggcc 5640 

35atagatgaag cggagtatta cgtcggcgac atcttcggcc atggcgtgga ggccactgga 5700 

tatggcctgg atgtaaaact gagctaaaga tgcgcgctcg ttgggcgagg ctctagtcca 57 60 

gcatcttcat acgcaaccaa ccttgcaggg cgatgagatc aaaggacgtt aaagcgaagg 5820 

ggaagtggtt cgggccatgc gcataccgat ccatgacatt tgtttcatag tatataggta 5880 

gaCaggCgaa tcaagcgctt agtcaactag tg^acacatc t^ttccatga ggctatctac 5940 

40tatctattca aaacaagaat aataaatagg atgaaaataa taatgataaa aagaacgatt 6000 
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tgtcttgtgt atcctctatt ctgtttggca agccccacat gggccgaaga gtcgccttgg 6060 

acgtaccgta ttggta-tgac taatgtagct ttcgatgcta gcgcaaaagt atacttaaat 6120 

ggtcagcggg tgccaggagg aagcgctgat gcgagcgata acaacgcgct tacattcgac 6180 

ttcggctacg ccatcaacga ccagtggaat gtacgtgcga ttgtcggtat tccgcctaca 6240 

5actaaagtga cgggcgcagg cacacttcct ggtatccagc tggggaaaat aacttacgct 6300 

ccaacagtat taacgttgaa ctataacctc cccgctttgg gtcccgttcg ccctcacata 6360 

99t9cgggag tcaattacac gcggattttt gaaagtcggg acgctaatct aaaatcgttc 6420 

gatgccgacc acgcttggtc ccccgcgcta catgttggtg ccgatattga cgttaaccgt 6480 

gsttggttcg ttagcattga tatccggaag ttatacctga aaaccgacgc atcagggtac 6540 

lOttggggccac aggaggcfcaa agcacgggta actcttgacc cattactaac ttcgatcgcg 6600 

atcggacgcc aattctgatg attctgttta aagttcttta tctatctaac cgcaaagggt 6660 

gtttccatgt cgaataaaat tatgaaaacg tcgcgtctta ccgccgaaga tatcaacggc 6720 

gcctggacta taatgcccac accctcgacg cctgatgctt ctgattggcg cagcactgcc 6780 

accgtggact tagaagagac tgcccgcata gttgaagagc tgattgcagc tggtgtcaac 6840 

15ggtattctaa gtatgggtac ttttggtgag tgcgccacgt tgacctggga tgaaaaacgt 6900 

gattatgtct cgacgattgt cgagaccatt cgtggtcgcg tgccttattt ctgtggcacg 6960 

acagccttaa atacccgaga agtcatccgc cagacccgag agcttatcga tattggcgcc 7020 

aacggcacta tgctcggggt gccgatgtgg gtgaagatgg acctgcctac agcggttcag 7080 

ttctatcgtg atgttgcaga tgcggtacca gaggctgcca ttgcgattta cgccaacccc 7140 

20gaagcattca agttcgactt ccctcgccca ttctgggcag agatgtccaa aattccgcag 7200 

gtagtgactg cgaagtatct aggcatcgga atgcttgact tggacctgag actggcaccc 7260 

aacatccgct tccttcccca cgaagatgac tattacgcgg ccgcacgcat caatcccgag 7320 

cgcataaccg cgttctggtc aagcggggcc atgtgcggcc cggctaccgc catcatgttg 7380 

cgtgacgaag tggtgcgggc caagagcacc ggtgactggg ccaaggccaa agccatctcc 7440 

25gatgatatgc gtgcagccga ctcgacattg tttccgcgtg gcgacttttc ggagttctcg 7500 

aagtataata tcgggcttga aaaggcacgg atggatgcgg ctggttggct caaggctggg 7560 

ccctgccgtc cgccctacaa ccttgttcca gaagactacc tcgctggtgc acagaaatca 7620 

ggcaaggctt gggccgcgct gcacgctaaa tacagtaatg aattgaagta gttcacctcc 7680 

gcagacctga gtgacagggt ggcgcagacg ctgagggtgc aggaattaag tgagctaaag 7740 

30cacatttctt gcgccaggca ttgccagatc agcaaagttt gctgatctgg cagtttcaaa 7800 

aatttgggcg aaagctgata tcaggaatac gggataaagg cagtgcacca taacgacggg 7860 

gcgtgccatt cgtgatgaac gattttgcta ttgtgccgac ttctgttctt ggagtgtttg 7920 

attgtgattg tcgattttta tttcgatttt ttgagtccgt tctcttactt ggccaaccat 7980 

cgtttgtcaa agcttgcgca agactatggc ttttccattc gttattacgc aatcgatttg 8040 

35gcgcgagtta aaatagccat cggaaacgtt ggtccatcta atcgcgacct gatagtcaag 8100 

ctggactatt tgaaagtaga tttgcaacgg tgggccgagc tttacgaaat accgttggta 8160 

ttcccagcta actacaacag ccgacggatg aatactgggc tttattactc gggagccatg 8220 

gcacagactg gtgcctatgt gaatgtagta tttaatgcgg tttggggaga tggcatagct 8280 

ccagatttgg aaagcttgcc tgctctggta tctgaaaaac taggctggga tcgtagcgcc 8340 

40ttcgaggact ttatcagcag cgatgccgca acagagaggt atgacgagca gacacatgcc 8400 
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gcgatcgaac gcaaagtgtt cggtgtgcca 
ggaaacgacc gtctafttat gctcgagaac 
tagtcgctac ggagcgcttg tgccggctaa 
atttgctcga tacagcgctt tcaaaatcag 
5ctacttcagg catcctgtgc gacacaaagt 
ttggaatggt agctgactcg ctatgcgacc 
gtaacggtgg ggtgtgaagc tcctgcaaca 
gaggtgaagc acagatgctc ggagccgtac 
gcaacgtaag tgggggttgg ggcgcaatgg 

lOgtcgttcggt gccttcctcc catgcctccg 
cctggctgat ctcggttagt aggtcatgca 
acggaatggt cagctcgtaa tggccggtct 
agtagcgctc gatgttctcc gtggcccgct 
ccaccgccag gcgcagggtg acggtggcca 

15cgatattgcg cttttgacct cgcgccaggg 
cacggtggcg taggtagctg tacagggtgc 
cgctgacgct caggcggccc tcgcggtaca 
ccttggctgg caggcccttg ggacggcgac 
ggcccgcctg agtccgctcg cggatcagct 

20gcgatGttgg gcgtgggtgc tgtcaatggg 
tgcagccagc tagccgacca actcaaccag 
cttccagatc accacggcat cgcccgctcg 
ccgcgcgctt tttgcgccgc tggcgatgtc 
cagggcatcg acctggaggt cggcgttcta 

25c 



acgatgtttt tgggcgatga aatgtggtgg 8460 

gcagtgggag gtgcgcctgt aaatggagaa 8520 

atgccgatat aagtggttga cctgatcgtt 8580 

cggctactga agtcagataa aaatgcggga 8640 

tttacctgta attgtccacc tattccgagt 8700 

agcgatagcc taacaagaca tgcatcactg 8760 

atgtagcccc ttgatgtgtg tatttgctgc 8820 

cggcttgtgg cgctaggctg gcaagtatga 8880 

gaaccaaaaa ccaacgcaag ccttaccagc 8940 

cctcgataaa gcagctgcgc atatcggctt 9000 

aggtcttgtc cagcgcctcg tcgctccgat 9060 

ccgaccgctt catgccgtag ggctccaggc 9120 

tccgaccgcg catgaacttg ctgttgttca 9180 

cccgctcgac ggttgactct gccggtgacg 9240 

cgctcttctg gtacgtcccg atctcgacgc 9300 

tcttggagat gtgcaacttc tcgccgatgg 9360 

gggtctccgc cgccatggcg gtggcctcgg 9420 

cgatccggcc tcgagtccgt gccgccgaca 9480 

cgcgctcgaa ctgaacaggt tgaacaccag 9540 

gtcgttcagg ctctgcaagc cgactttgcg 9600 

gtgcttgagc gagcgaccga ggcgatccag 9660 

aacatgggct agcaacttgt ccaactccgg 9720 

ttgatagatg cgttcgcacc cggcctgttt 9780 

atcccgagtg ctcacccgcg tataaccgat 9840 

9841 



<210> 4 
30 <211> 2515 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> A modified DNA sequence 

<400> 4 

gaattcatca ggaagacatt caaatgaacg taaacaataa gggcagcgtc tgtatttgcg 60 
gcagcgaaat gctccctaaa ttcctcattt accccatctg aggattgctt tatgacagta 120 
40 aagtggattg aagcagtcgc tctttctgac atccttgaag gtgacgtcct cggcgtgact 180 
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gtcgagggca 


3-ggagctggc 


gctgtatgaa 


gttgaaggcg 


aaatctacgc 


taccgacaac 


240 


ctgtgcacgc 


atggtrtccgc 


ccgcatgagt 


gatggttatc 


tcgagggtag 


agaaatcgaa 


300 


tgccccttgc 


atcaaggtcg 


gtttgacgtt 


tgcacaggca 


aagccctgtg 


cgcacccgtg 


360 


acacagaaca 


tcaaaacata 


tccagtcaag 


attgagaacc 


tgcgcgtaat 


gattgatttg 


420 


5 agctaagaat 


tttaacagga 


ggcaccccgg 


gccctagagc 


gtaatcaccc 


ccattccatc 


480 


ttttttaggt gaaaacatga 


attacaataa 


taaaatcttg 


gtaagtgaat 


ctggtctgag 


540 


ccaaaagcac 


ctgatfccatg gcgatgaaga 


acttttccaa 


catgaactga 


aaaccatttt 


600 


tgcgcggaac tggctttttc tcactcatga 


tagcctgatt 


cctgcccccg 


gcgactatgt 


660 


taccgcaaaa 


atggggattg acgaggtcat 


cgtctcccgg 


cagaacgacg 


gttcgattcg 


720 


10 tgcttttctg 


aacgtttgcc ggcatcgtgg 


caagacgctg gtgagcgtgg 


aagccggcaa 


780 


tgccaaaggt 


tttgtttgca 


gctatcacgg 


ctggggcttc 


ggctccaacg 


gtgaactgca 


840 


gagcgttcca 


tttgaaaaag 


atctgtacgg 


cgagtcgctc 


aataaaaaat 


gtctggggtt 


900 


gaaagaagtc 


gctcgcgtgg 


agagcttcca 


tggcttcatc 


tacggttgct 


tcgaccagga 


960 


ggcccctcct 


cttatggact 


atctgggtga 


cgctgcttgg 


tacctggaac 


ctatgttcaa 


1020 


15 gcattccggc 


ggtttagaac 


tggtcggtcc 


tccaggcaag gttgtgatca 


aggccaactg 


1080 


gaaggcaccc 


gcggaaaact 


ttgtgggaga 


tgcataccac 


gtgggttgga 


cgcacgcgtc 


1140 


ttcgcttcgc 


tcgggggagt 


ctatcttctc 


gtcgctcgct ggcaatgcgg 


cgctaccacc 


1200 


tgaaggcgca 


ggcttgcaaa 


tgacctccaa 


atacggcagc ggcatgggtg 


tgttgtggga 


1260 


cggatattca ggtgtgcata 


gcgcagactt 


ggttccggaa 


ttgatggcat 


tcggaggcgc 


1320 


20 aaagcaggaa 


aggctgaaca 


aagaaattgg 


cgatgttcgc 


gctcggattt 


atcgcagcca 


1380 


cctcaactgc 


accgttttcc 


cgaacaacag 


catgctgacc 


tgctcgggtg 


ttttcaaagt 


1440 


atggaacccg 


atcgacgcaa 


acaccaccga 


ggtctggacc 


tacgccattg 


tcgaaaaaga 


1500 


catgcctgag gatctcaagc gccgcttggc 


cgactctgtt 


cagcgaacgg 


tcgggcctgc 


1560 


tggcttctgg gaaagcgacg acaatgacaa 


tatggaaaca 


gcttcgcaaa 


acggcaagaa 


1620 


25 atatcaatca 


agagatagtg atctgctttc 


aaaccttggt 


ttcggtgagg 


acgtatacgg 


1680 


cgacgcggtc 


tatccaggcg 


tcgtcggcaa 


atcggcgatc 


ggcgagacca 


gttatcgtgg 


1740 


tttctaccgg 


gcttaccagg 


cacacgtcag 


cagctccaac 


tgggctgagt 


tcgagcatgc 


1800 


ctctagtact 


tggcatactg 


aacttacgaa 


gactactgat 


cgctaacaga 


cgagtcgacc 


1860 


atgatgatca 


atattcaaga 


agacaagctg 


gtttccgccc 


acgacgccga 


agagattctt 


1920 


30 cgtttcttca 


attgccacga 


ctctgctttg 


caacaagaag 


ccactacgct 


gctgacccag 


1980 


gaagcgcatt 


tgttggacat 


tcaggcttac 


cgtgcttggt 


tagagcactg 


cgtggggtca 


2040 


gaggtgcaat 


atcaggtcat 


ttcacgcgaa 


ctgcgcgcag cttcagagcg 


tcgttataag 


2100 


ctcaatgaag ccatgaacgt ttacaacgaa 


aattttcagc aactgaaagt 


tcgagttgag 


2160 


catcaactgg 


atccgcaaaa 


ctggggcaac 


agcccgaagc 


tgcgctttac 


tcgctttatc 


2220 


35 accaacgtcc 


aggccgcaat 


ggacgtaaat 


gacaaagagc 


tacttcacat 


ccgctccaac 


2280 


gtcattctgc 


accgggcacg 


acgtggcaat 


caggtcgatg 


tcttctacgc 


cgcccgggaa 


2340 


gataaatgga aacgtggcga 


aggtggagta 


cgaaaattgg 


tccagcgatt 


cgtcgattac 


2400 


ccagagcgca 


tacttcagac gcacaatctg 


atggtctttc 


tgtgattcag 


tgaccatttt 


2460 


tacaaatggt 


cactgcaacc gcggtcacca 


ttaatcaaag ggaatgtacg 


tgtat 


2515 



40 
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<210> 5 

<211> 9706. 

<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> A modified DKA sequence 



<400> 5 



lOgctgatttcg 


accaattafct 


ccacgccgtc 


tgcgccagca 


aggccgtaat 


cagttcgtaa 


60 


aggggagccc 


gcagtaacag 


cgaaccgacg 


ccggtagctg 


cgcgcgtgaa 


tcctgagatc 


120 


acaaattctt 


ggcgattatc 


gcatttcaat 


ttcacccaag 


cgccgttcct 


tgcgctgctg 


180 


tagcggcggc 


catcacgctt 


ctccacgatt 


ccctccagct 


gcatccggca 


agcgttggcc 


240 


agcagctgca 


ctgggtaagc 


ttccagggta 


tcgctgaagc 


gtacgcgctc 


caatgggcag 


300 


15tgctcgagga 


gcgcctcaag 


tacctggcgg 


ctcagttcaa 


cgccctctcc 


ccgcagatct 


360 


gcaccgtcga 


ggtagagcag 


gtcgaacgcc 


acgtacacaa 


atttgtcggt 


acaccgtgcg 


420 


gcgaacgctg 


actgcagggc 


ttgaaatgct 


ggtcgaccgt 


cgtcatactg 


gaatacgacc 


480 


tcaccgtcga 


gccaggctga 


atggacggct 


aggcgtccca 


gctatttggc 


caggagtggc 


540 


atatgatcga 


tctagtcaaa 


cccgtttttg 


gtgaaaagct 


gcacctggtc 


accatcgatc 


600 


20cgtgctaaca 


gacgatacca 


gtcgtacttg 


atctcgtatc 


gccatgctcc 


tgccgacgac 


660 


agtgtcagta 


gagtaaccag 


ttgtggcgcg 


atccaacacg 


ggtttggtgc 


cttgggcggt 


720 


ttcactatcg 


ccacctcgct 


ataggaacat 


tccttcaggg 


tggagtgcgt 


aattttctga 


780 


aaggggagcc 


aggttatgag 


tattcacatt 


ggtgataaac 


aacatcactt 


atgcgttatt 


840 


gacatataac 


gtcgtattca 


cgattattta 


ccatataagt 


cttataataa 


cgaagccata 


900 


25ttatggaact 


cctcatacaa 


ccgaaaaatc 


gcataattcc 


cttcagtgcc 


ggtgccaacc 


960 


ttctggaagt 


gcttcgcgag 


aacggtgtag 


ctatttccta 


cagttgcttg 


tctgggcgtt 


1020 


gcggaacctg 


tcgctgccgg 


gttatagatg 


gcagtgtcat 


tgattctggg 


gcggaaaatg 


1080 


ggcaatcaaa 


cctcaccgac 


aagcagtatg 


tgctcgcctg 


tcagtcagta 


ctcactggca 


1140 


attgcgctat 


cgaagtccca 


gaagccgacg 


aaattgtcac 


tcacccggcg 


cgaatcatca 


1200 


30agggcacagt 


ggtcgcagtc 


gagtcgccca 


ctcacgatat 


ccgtcgctta 


cgcgtacgcc 


1260 


tctccaagcc 


cttcgagttc 


tcacccggac 


agtacgcgac 


actgcagttc 


agccctgagc 


1320 


atgcgcgtcc 


gtattcaatg 


gcaggtttgc 


cagatgacca 


agaaatggag 


ttccacatac 


1380 


gcaaggtgcc 


gggtgggcgc 


gtcacggagt 


atgttttcga 


acacgtccgc 


gaaggtacaa 


1440 


gcatcaagtt 


gagcgggcct 


cttggtacgg 


cttatctacg 


tcagaagcac 


accggaccga 


1500 


35tgctgtgtgt 


aggtggcggg 


accggactcg 


caccggtgct 


gtcgattgtt 


cgcggcgcgc 


1560 


tgaagtcggg 


tatgacgaac 


cccatcctcc 


tttatttcgg 


ggtgcgcagt 


cagcaagacc 


1620 


tctacgacgc 


agagcgattg 


cacaaactcg 


ccgctgacca 


ccctcaactg 


accgtacaca 


1680 


C99tgattgc 


aacgggcccg 


attaatgagg 


gtcagcgagc 


cggcctaatt 


accgatgtga 


1740 


tcgaaaaaga 


catcctttcg 


ctggctgggt 


ggagggccta 


cctgtgcggc 


gcaccagcga 


1800 


40tggttgaagc 


gttgtgcacc 


gtcaccaagc 


atcttggaat 


atcacccgaa 


catatttatg 


1860 
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ccgatgcctt 


ctatcccggt 


gggatctgaa 


tagttcccgg 


ccatgcacct 


ctgtccatcg 


1920 


agaattcatc 


aggaagacat 


tcaaatgaac 


gtaaacaata 


agggcagcgt 


ctgtatttgc 


1980 


ggcagcgaaa 


tgctccctaa 


attcctcatt 


taccccatct 


gaggattgct 


ttatgacagt 


2040 


aaagtggatt 


gaagcagtcg 


ctctttctga 


catccttgaa 


ggtgacgtcc 


tcggcgtgac 


2100 


Stgtcgagggc 


aaggagctgg 


cgctgtatga 


agttgaaggc 


gaaatctacg 


ctaccgacaa 


2160 


cctgtgcacg 


catggttccg 


cccgcatgag 


tgatggttat 


ctcgagggta 


gagaaatcga 


2220 


atgccccttg 


catcaaggtc 


ggtttgacgt 


ttgcacaggc 


aaagccctgt 


gcgcacccgt 


2280 


gacacagaac 


atcaaaacat 


atccagtcaa 


gattgagaac 


ctgcgcgtaa 


tgattgattt 


2340 


gagctaagaa 


ttttaacagg 


aggcaccccg 


ggccctagag 


cgtaatcacc 


cccattccat 


2400 


lOcttttttagg 


tgaaaacatg 


aattacaata 


ataaaatctt 


ggtaagtgaa 


tctggtctga 


2460 


gccaaaagca 


cctgattcat 


ggcgatgaag 


aacttttcca 


acatgaactg 


aaaaccattt 


2520 


ttgcgcggaa 


ctggcttttt 


ctcactcatg 


atagcctgat 


tcctgccccc 


ggcgactatg 


2580 


ttaccgcaaa 


aatggggatt 


gacgaggtca 


tcgtctcccg 


gcagaacgac 


ggttcgattc 


2640 


gtgcttttct 


gaacgtttgc 


cggcatcgtg 


gcaagacgct 


ggtgagcgtg 


gaagccggca 


2700 


ISatgccaaagg 


ttttgtttgc 


agctatcacg 


gctggggctt 


cggctccaac 


ggtgaactgc 


2760 


agagcgttcc 


atttgaaaaa 


gatctgtacg 


gcgagtcgct 


caataaaaaa 


tgtctggggt 


2820 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


ctacggttgc 


ttcgaccagg 


2880 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg 


gtacctggaa 


cctatgttca 


2940 


agcattccgg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


ggttgtgatc 


aaggccaact 


3000 


20ggaaggcacc 


cgcggaaaac 


tttgtgggag 


atgcatacca 


cgtgggttgg 


acgcacgcgt 


3060 


cttcgcttcg 


ctcgggggag 


tctatcttct 


gctcgctcgc 


tggcaatgcg 


gcgctaccac 


3120 


ctgaaggcgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


cggcatgggt 


gtgttgtggg 


3180 


acggatattc 


aggtgtgcat 


agcgcagact 


tggttccgga 


attgatggca 


ttcggaggcg 


3240 


caaagcagga 


aaggctgaac 


aaagaaattg 


gcgatgttcg 


cgctcggatt 


tatcgcagcc 


3300 


25acctcaactg 


caccgttttc 


ccgaacaaca 


gcatgctgac 


ctgctcgggt 


gttttcaaag 


3360 


tatggaaccc 


gatcgacgca 


aacaccaccg 


aggtctggac 


ctacgccatt 


gtcgaaaaag 


3420 


acatgcctga 


ggatctcaag 


cgccgcttgg 


ccgactctgt 


tcagcgaacg 


Qtcgggcctg 


3480 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


agcttcgcaa 


aacggcaaga 


3540 


aatatcaatc 


aagagatagt 


gatctgcttt 


caaaccttgg 


tttcggtgag 


gacgtatacg 


3600 


30gcgacgcggt 


ctatccaggc 


gtcgtcggca 


aatcggcgat 


cggcgagacc 


agttatcgtg 


3660 


gtttctaccg 


ggcttaccag 


gcacacgtca 


gcagctccaa 


ctgggctgag 


ttcgagcatg 


3720 


cctctagtac 


ttggcatact 


gaacttacga 


agactactga 


tcgctaacag 


acgagtcgac 


3780 


catgatgatc 


aatattcaag 


aagacaagct 


ggtttccgcc 


cacgacgccg 


aagagattct 


3840 


tcgtttcttc 


aattgccacg 


actctgcttt 


gcaacaagaa 


gccactacgc 


tgctgaccca 


3900 


35ggaagcgcat 


ttgttggaca 


ttcaggctta 


ccgtgcttgg 


ttagagcact 


gcgtggggtc 


3960 


agaggtgcaa 


tatcaggtca 


tttcacgcga 


actgcgcgca 


gcttcagagc 


gtcgttataa 


4020 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag 


caactgaaag 


ttcgagttga 


4080 


gcatcaactg 


gatccgcaaa 


actggggcaa 


cagcccgaag 


ctgcgcttta 


ctcgctttat 


4140 


caccaacgtc 


caggccgcaa 


tggacgtaaa 


tgacaaagag 


ctacttcaca 


tccgctccaa 


4200 


40cgtcattctg 


caccgggcac 


gacgtggcaa 


tcaggtcgat 


gtcttctacg 


ccgcccggga 


42^0 
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agataaatgg aaacgtggcg aaggtggagt acgaaaattg gtccagcgat bcgtcgatta 43 20 

cccagagcgc atacttpaga cgcacaatct gatggtcttt ctgtgattca gtgaccattt 4380 

ttacaaatgg tcactgcaac cgcggtcacc attaatcaaa gggaatgtac gtgtatgggc 4440 

aatcaacaag tcgtttcgat aaccggtgca ggctcaggaa tcggtctcga actggttcgg 4500 

5tcctttaagt cggccggtta ttacgtatcc gctctcgtac gaaacgagga gcaagaggcg 4560 

cttctttgca aagagttcaa ggacgcactc gagattgtag fcgggcgatgt ccgggaccac 4620 

gcaacaaatg agaagctgat aaagcaaaca atcgatagat tcggtcatct tgattgtttt 4680 

attgcaaatg ccggtatctg ggattacatg ctgagcatcg aagagccttg ggagaaaata 4740 

tcgagcagtt ttgacgaaat attcgacatt aatgtcaaga gctatttcag tggcatcagt 4800 

lOgccgccctgc cggaactgaa aaagactaac ggatcagtgg tgatgaccgc ttcggtgtcg 4860 

tcccatgcgg tcggtggtgg tggttcttgc tacatcgcca gcaagcatgc ggtgctcggt 4920 

atggttaagg ctttggccta cgaattggcc cccgaagttc gcgtgaacgc tgtttcgccg 4980 

gggggcaccg tgacgtctct gtgcggttcc gcgagcgccg gtttcgacaa aatgcacatg 5040 

aaagacatgc ccggcatcga cgatatgatc aaaggtctca cgcctcttgg gtttgcagcc 5100 

ISaagcccgaag acgtggtggc accctatttg ttgctggctt cgcgaaagca aggaaaattc 5160 

atcaccggca ccgtgattag cattgatggc ggtatggcgc tcggtcgcaa gtgagcttgt 5220 

agccgatcag aagttataga cacatttcag gtgacgcccc atgaagacaa aactgtttat 52 80 

caataacgcc tggatcgatt ctagtgacca gcagaccttc gagcgcatac accccgtcag 5340 

cagcgatgtg gtgactgaga gcgcaaacgc cacagtgacg gacgcgataa aggcggcgca 5400 

20agcggccgag gaggcgttca agacctggaa ggccgttgga ccttcagagc gtcgccgcct 5460 

tctcctaaag gtcgccgatg tcatggaaag taaaacaccc aagttcatcg aagtgatggc 5520 

catggaggtg ggagcttccg ccctttgggc cggattcaac gtccatgcgt ctgccaatgt 5580 

gttccgagag gctgcctcgc tggctaccca aattcagggt gaaaccatcc caacggacaa 5640 

agccgaaacg ctctcaatga cactacgtca gccggtcggc ccgatcctaa gcatcgttcc 5700 

25atggaacggc accgcagtgc ttgcggcacg agccatcgct tatccgctgg tctgtggcaa 5760 

cactgtggtg ttcaaaggct ctgaatttag tcccgcgacg catgccctga tcacccagtg 5820 

cgtgcaggaa gccgggctgc ccgctggcgt gctcaattac ctcaactctt cgcctgaccg 5880 

ttcgcccgag atcgctgacg cactgatctc tgccaaggag atccgccgca tcaacttcac 5940 

gggttccacc cgcgtgggca gcattatcgc gcagaaagcc gcgcaacacc tcaagcgctg 6000 

30cctgctggag ctcggcggca agtccccgct tattgttctg gatgatgcag acatcgatgc 6060 

ggcggtcaag gcagcggtgt tcggtagctt cctgttccaa ggtcagatct gcatgtccac 6120 

tgagcgcttg atcgttgatg agaagatagc cgacgaattt gtcgcaaaat ttgtcgaaaa 6180 

aactaagcgc ttgagcgcag gcgacccgtg cgtaactggc gactgcatca tcggcccgat 6240 

ggtctcgcca aattcgggtg agcggatcaa tggtttgttc aaagacgcga tcgacaaagg 6300 

35ggcaaaagtt gtttgcggcg gcttggccca aggtgcgctc atgccggcca cgatcctgga 6360 

tcacgtcaaa tctgacatgc ggatttacga tgaggagacc tttggtccca tcaccgtggt 6420 

aatccgttgt aaaggcgaag cagaggccgt ccgcattgcc aacgacagcg tctatggcct 6480 

gtcgtcgggc gtatttggcc gcgacatcaa ccgcgctcta cgcgtgggta tgtccatcga 6540 

atatggttct gtacacatca acggttcgac cgtccagaac gaggcc^cagg ctccttacgg 6600 

40aggcaccaag aacaccggct acgggcgctt cgacggccgt gctgtaatcg acgagttcac 6660 
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tggctgacca 


tcgaaccttt 


cgagcagcaa 


tatcccttct 


gataagcact 


6720 


aactcccagg 


aatcaa*^cta 


tgagtaagca 


agctgcagtt 


atcgagctcg 


gatacatggg 


6780 


tatctcggtc 


aaggaccctg 


atgcgtggaa 


atcatttgcc 


acggatatgc 


taggtctgca 


6840 


agttcttgat 


gagggtgaga 


aggaccgttt 


ctatctgcgg 


atggattact 


ggcatcatcg 


6900 


Sgatcgtagtc 


catcacaacg 


gacaggacga 


cttggagtac 


ctaggctggc 


gtgtagccgg 


6960 


caagccggag 


ttcgaagctc 


tgggtcaaaa 


gcttattgat 


gccggttaca 


agatccgcat 


7020 


ctgcgacaaa 


gttgaggctc 


^gg^gcgtat 


ggtgttgggt 


ctgatgaaga 


cagaagatcc 


7080 


gggcggcaac 


ccgaccgaga 


tattctgggg 


cccccggatc 


gacatgagca 


acccgttcca 


7140 


tcccggtcgc 


cccctgcacg 


gaaagtttgt 


gaccggtgac 


caaggcttgg 


gccattgcat 


7200 


lOcgttcgccaa accgacgtcg 


cagaagctca 


taagttttat 


agcctgctgg 


gcttccgtgg 


7260 


ggacgtcgaa 


taccggattc 


cgttgcccaa 


cggcatgact 


gccgaactgt 


cgttcatgca 


7320 


ttgcaacgcc 


cgtgatcact 


ccattgcttt 


tggtgccatg 


cccgctgcca 


aacgactcaa 


7380 


tcacttgatg 


cttgagtaca 


cccatatgga 


agacttggga 


tacacgcacc 


aacagtttgt 


7440 


aaagaacgaa 


attgacattg 


ccttgcagct 


tggcattcac 


gccaacgaca 


aggcgttgac 


7500 


15gttctatggt gcaacgcctt 


cgggctggct 


cattgagccc 


ggctggcgag 


gtgccacggc 


7560 


catagatgaa gcggagtatt 


acgtcggcga 


catcttcggc 


catggcgtgg 


aggccactgg 


7620 


atatggcctg 


gatgtaaaac 


tgagctaaag 


atgcgcgctc 


gttgggcgag 


gctctagtcc 


7680 


agcatcttca 


tacgcaacca 


accttgcagg 


gcgatgagat 


caaaggacgt 


taaagcgaag 


7740 


gggaagtggt 


tcgggccatg 


cgcataccga 


tccatgacat 


ttgtttcata 


gtatataggt 


7800 


20agataggtga 


atcaagcgct 


tagtcaacta 


gtggacacat 


ctgttccatg 


aggctatcta 


7860 


ctatctattc 


aaaacaagaa 


taataaatag 


gatgaaaata 


ataatgataa 


aaagaacgat 


7920 


ttgtcttgtg 


tatcctctat 


tctgtttggc 


aagccccaca 


tgggccgaag 


agtcgccttg 


7980 


gacgtaccgt 


attggtatga 


ctaatgtagc 


tttcgatgct 


agcgcaaaag 


tatacttaaa 


8040 


tggtcagcgg gtgccaggag 


gaagcgctga 


tgcgagcgat 


aacaacgcgc 


ttacattcga 


8100 


25cttcggctac 


gccatcaacg 


accagtggaa 


tgtacgtgcg 


attgtcggta 


ttccgcctac 


8160 


aactaaagtg acgggcgcag 


gcacacttcc 


tggtatccag 


ctggggaaaa 


taacttacgc 


8220 


tccaacagta 


ttaacgttga 


actataacct 


ccccgctttg 


ggtcccgttc 


gccctcacat 


8280 


aggtgcggga 


gtcaattaca 


cgcggatttt 


tgaaagtcgg 


gacgctaatc 


taaaatcgtt 


8340 


cgatgccgac 


cacgcttggt 


cccccgcgct 


acatgttggt 


gccgatattg 


acgttaaccg 


8400 


SOtggttggttc gttagcattg 


atatccggaa 


gttatacctg 


aaaagcgacg 


catcagggta 


8460 


cttggggcca 


caggaggcta 


aagcacgggt 


aactcttgac 


ccattactaa 


cttcgatcgc 


8520 


gatcggacgc 


caattctgat 


gattctgttt 


aaagttcttt 


atctatctaa 


ccgcaaaggg 


8580 


tgtttccatg 


tcgaataaaa 


ttatgaaaac 


gtcgcgtctt 


accgccgaag 


atatcaacgg 


8640 


cgcctggact 


ataatgccca 


caccctcgac 


gcctgatgct 


tctgattggc 


gcagcactgc 


8700 


35caccgtggac 


ttagaagaga 


ctgcccgcat 


agttgaagag 


ctgattgcag 


ctggtgtcaa 


8760 


cggtattcta agtatgggta 


cttttggtga 


gtgcgccacg 


ttgacctggg 


atgaaaaacg 


8820 


tgattatgtc 


tcgacgattg 


tcgagaccat 


tcgtggtcgc 


gtgccttatt 


tctgtggcac 


8860 


gacagcctta 


aatacccgag 


aagtcatccg 


ccagacccga 


gagcttafccg 


atattggcgc 


8940 


caacggcact 


atgctcgggg 


tgccgatgtg 


ggtgaagatg 


gacctgccta 


cagcggttca 


9000 


40gttctatcgt gatgttgcag 


atgcggtacc 


agaggctgcc 


attgcgattt 


acgccaaccc 


9060 
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cgaagcattc 


aagttcgact 


tccctcgccc 


attctgggca 


gagatgtcca 


aaattccgca 


9120 


ggtagtgact 


gcgaagt:afcc 


taggcatcgg 


aatgcttgac 


ttggacctga 


gactggcacc 


9180 


caacatccgc 


ttccttcccc 


acgaagatga 


ctattacgcg 


gccgcacgca 


tcaatcccga 


9240 


gcgcataacc 


gcgttctggt 


caagcggggc 


catgtgcggc 


ccggctaccg 


ccatcatgtt 


9300 


5gcgtgacgaa 


gtggtgcggg 


ccaagagcac 


cggtgactgg gccaaggcca aagccatctc 


9360 


cgatgatatg 


cgtgcagccg 


actcgacatt 


gtttccgcgt 


ggcgactttt 


cggagttctc 


9420 


gaagtataat 


atcgggcttg 


aaaaggcacg 


gatggatgcg gctggttggc 


tcaaggctgg 


9480 


gccctgccgt 


ccgccctaca 


accttgttcc 


agaagactac 


ctcgctggtg 


cacagaaatc 


9540 


aggcaaggct 


tgggccgcgc 


tgcacgctaa 


atacagtaat 


gaattgaagt 


agttcacctc 


9600 


lOcgcagacctg 


agtgacaggg 


tggcgcagac 


gctgagggtg 


caggaattaa 


gtgagctaaa 


9660 


gcacatttct 


tgcgccaggc 


attgccagat 


cagcaaagtt 


tgctga 




9706 



15 <210> 6 

<211> 2294 
<212> DNA 

<213> Artificial Sequence 

20 <22Q> 

<223> A modified DNA sequence 

<221> unsure 
<222> (186) . . . (186) 
25 ^223 > n - a or t or g or c 

<400> 6 

cLgggcagckt ctgtatttgc ggcagcgaaa tgctccctaa attcctcatt taccccatct 60 

gaggattgct ttatgacagt aaagtggatt gaagcagtcg ctctttctga catccttgaa 12 0 

30 ggtgacgtcc tcggcgtgac tgtcgagggc aaggagctgg cgctgtatga agttgaaggc 180 

gaaatntacg ctaccgacaa cctgtgcacg catggttccg cccgcatgag tgatggttat 240 

ctcgagggta gagaaatcga atgccccttg catcaaggtc ggtttgacgt ttgcacaggc 300 

aaagccctgt gcgcacccgt gacacagaac atcaaaacat atccagtcaa gattgagaac 360 

ctgcgcgtaa tgattgattt gagctaagaa ttttaacagg aggcaccccg ggccctagag 420 

35 cgtaatcacc cccattccat ctttttfcagg tgaaaacatg aattacaata ataaaatctt 480 

ggtaagtgaa tttggtctga gccaaaagca cctgattcat ggcgatgaag aacttttcca 540 

acatgaactg aaaaccattt ttgcgcggaa ctggcttttt ctcactcatg atagcctgat 600 

tcctgccccc ggcgactatg ttaccgcaaa aatggggatt gacgaggtca tcgtctcccg 660 

gcagaacgac ggttcgattc gtgctt;ttct gaacgtttgc cggcatcgtg gcaagacgct 720 

40 ggtgagcgtg gaagccggca atgccaaagg ttttgtttgc agctatcacg gctggggctt 780 
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cggctccaac 


ggtgaactgc 


agagcgttcc 


atttgaaaaa 


gatctgtacg gcgagtcgct 


840 




caataaaaaa 


tgtcfggggt 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


900 




ctacggttgc 


ttcgaccagg 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg 


960 




gtacctggaa 


cctatgttca 


agcattccgg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


1020 


5 


ggttgtgatc 


aaggccaact 


ggaaggcacc 


cgcggaaaac 


tttgtgggag 


atgcatacca 


1080 




cgtgggttgg 


acgcacgcgt 


cttcgcttcg 


ctcgggggag 


tctatcttct 


cgtcgctcgc 


1140 




tggcaatgcg 


gcgctaccac 


ctgaaggcgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


1200 




cggcatgggt 


gtgttgtggg 


acggatattc 


aggtgtgcat 


agcgcagact 


tggttccgga 


1260 




attgatggca 


ttcggaggcg 


caaagcagga 


aaggctgaac 


aaagaaattg gcgatgttcg 


1320 


10 


cgctcggatt 


tatcgcagcc 


accfccaactg caccgttttc ccgaacaaca gcatgctgac 


1380 




ctgctccggt 


gttttcaaag 


tatggaaccc gatcgacgca aacaccaccg aggtctggac 


1440 




ctacgccatt 


gtcgaaaaag 


acatgcctga ggatctcaag cgccgcttgg 


ccgactctgt 


1500 




tcagcgaacg 


gtcgggcctg 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


1560 




agcttcgcaa 


aacggcaaga 


aatatcaatc 


aagagatagt 


gatctgcttt 


caaaccttgg 


1620 


15 


tttcggtgag 


gacgtatacg 


gcgacgcggt 


ctatccaggc 


gtcgtcggca 


aatcggcgat 


1680 




cggcgagacc 


agttatcgtg 


gtttctaccg ggcttaccga 


gcacacgtca 


gcagctccaa 


1740 




ctgggctgag 


ttcgagcatg 


cctctagtac 


ttggcatact gaacttacga agactactga 


1800 




tcgctaacag 


acgagtcgac 


catgatgatc 


aatattcaag 


aagacaagct 


ggtttccgcc 


1860 




cacgacgccg 


aagagattct 


tcgtttcttc 


aattgccacg 


actctgcttt 


gcaacaagaa 


1920 


20 


gccactacgc 


tgctgaccca 


ggaagcgcat 


ttgttggaca 


ttcaggctta 


ccgtgcttgg 


1980 




ttagagcact 


gcgtggggtc 


agaggtgcaa 


tatcaggtca 


tfctcacgcga 


actgcgcgca 


2040 




gcttcagagc 


gtcgttataa 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag 


2100 




caactgaaag 


ttcgagfctga gcatcaactg gatccgcaaa actggggcaa cagcccgaag 


2160 




ctgcgcttta 


ctcgctttat 


caccaacgtc 


caggccgcaa 


tggacgtaaa 


tgacaaagag 


2220 


25 


ctacttcaca 


tccgctccaa 


cgtcattctg caccgggcac gacgtggcaa 


tcaggtcgat 


2280 




gtcttctacg 


ccgc 










2294 



<210> 7 
<211> 4355 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 

35 

<400> 7 

atcgataaaa ccatgagggt aataaataat gatgaaggag actatctaga attcgttctg 60 
gagacggtgc cggtggcata accgtagacc agcaggctga gcaaaatacc tcggggtgat 120 
aagccgcgct accgcgtccg ccatatgcct gagtgagtgc gctaaggtcg ggccgctcaa 180 
40 tggcttcgag aataaagcgg gccaggtgat cttctgtcaa tctcgccgaa tgggtggaag 240 
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cacatcaacc tgaccggcga ttacgtctgg cggcagagcc gcagactgga ggtcgggaag 300 

ttccggccgt tacggtzggcc ctgaaaacct tagtgtacga ttttttccgt cttctctatt 360 

cggcctgcgt tgcgacgttc gatctgacta gggcgcagcg gcaacatcgg ctgagtacga 4 20 

tccagcgcct ggatctgggt tatttcgtca accgacaaca ccagagcgtt atccggtggg 4 80 

5 ttcaggtaaa gcccaaccac attgacfcacc tctcggcaaa atgagggtcg ttactaattt 54 0 

tgaacgtctt caagcgatgg ggttttaggt cggaagctgc ccagacctgc ccaacctacc 600 

acgtcgtgac ccgagcatat tcggcccgcg gcggcaaaaa acacagaaat gagcgggtga 660 

cccgatcgcc tttgatcgat tctccgcttt caaaatgggc gggggctgaa gtcagccaga 720 

aatacatgac tacttcagac gattcgcagt attcacgctg gtgataaaca aattcaacta 780 

10 tgctttattg acaaataaaa gcacgctcac catcatcgcg aatacaaatc ttataaaaat 840 

taagccggat ttatggaact tctcatacag ccaaacaatc gcctcattag ctttagtccc 900 

ggcgccaacc ttctggaagt gcttcgcgaa aacggtgtcg ctatttccta cagttgtatg 960 

tctgggcgtt gcggaacctg ccgctgtcgg gttacagatg gcagtgtaat tgattcgggg 1020 

9C9ggaagcg ggttaccaaa cctcgtggac gagcattatg tgctcgcctg tcagtcagta 1080 

15 cttactcaca attgcgcgat cgaaatccca gaaaccgacg aaatcgtcac ccacccggcg 1140 

agaatcatca agggcactgt ggtcgccgtc gagtcgccca ctcacgatat ccgtcgccta 1200 

cgcgtacgcc tcgctaagcc cttcgagttc tcacccggac agtacgcgac attgcagttc 1260 

agtcctgagc atgcgcgtcc gtattcaatg gcaggtctgc cagatgacca agaaatggag 1320 

ttccacatac gcaaggtgcc gggtgggcgc gtaacggagt atgttttcga gcacgtccgc 1380 

20 gaaggtacaa gcatcaagtt gagcgggcca cttggtacgg cttatttgcg tcagaaccac 1440 

accgggccga tgctctgtgt gggcggtggg accggactag caccggtgct gtcgattgtt 1500 

cgcggcgcgc tgaagttggg tatgacaaac cccatcctcc tttatttcgg agtgcgcagt 1560 

cagcaagacc tctacgacgc agagcgattg cacaaactcg ccgctgatca ccctcaactg 1620 

accgtacaca cggtaatcgc aatgggcccg attaatgaga gtcagcgagc cggtctagtt 1680 

25 accgatgtga tcgaaaaaga catcatttcg ctggctgggt ggagggccta cctgtgcggc 1740 

gcaccagcga tggttgaagc gctttgcacc gttaccaagc atcttggaat atcacccgaa 1800 

catatttatg ccgatgcctt ctatcccggt ggaatctgaa tagtcccttt cccfcgcacct 1860 

ctgtccatcg aggactcatc aggaggatac tcaaataagc gcaaataata acagtcgcgt 1920 

cagtatttgc ggcagcgaaa tggtttccct ctccctcatt taccccatct gaggatagtt 198 0 

30 ttatgacaga aaaatggatt gaagcagtcg ccctttctga catcccagaa ggtgatgtcc 2040 

tcggcgtgac tgtcgagggc aaggagctgg cgttgtacga agtggaaggc gaaatctacg 2100 

ctaccgacaa cctgtgcacg catggtgccg cccgcatgag cgatggttat ctcgaggggc 2160 

gagaaatcga atgccccttg catcaaggtc ggtttgacgt ttgtacaggc agagccctct 2220 

gcgcccccgt gacagagaac atcaaaacat atgcagtcaa gattgagaac ctgcgcgtaa 2280 

35 tgattgattt gagcggagag ttttaatagg aggtaccccg gaccctagcg cgtaactacc 2340 

ccaattccat cttttttagg tgaaaacatg aattacaaaa acaaaatatt ggtgagtgaa 2400 

tctgggctga cccaaaagca cctgattcat ggcgatgaag aacttttcca gcacgaactg 2460 

agaaccattt ttgcgcggaa ctggcttttt ctcactcatg acagcctgat tccatccccc 2520 

ggcgactatg ttaccgcaaa aatgggtatt gacgaggtca tcgtctctcg gcagagcgac 2580 

40 ggttcgattc gtgccttcct gaacgtttgt cggcaccgtg gcaagacgct ggttaacgcg 2640 
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gaagccggca 


atgccaaagg 


tttcgtttgc 


agctatcacg 


gctggggctt 


cggctccaac 


2700 




ggtgaactgc 


agagcgttcc 


attcgaaaaa 


gagctgtacg 


gcgagtcgct 


caacaaaaaa 


2760 




tgtctggggt 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


ctatggttgc 


2820 




ttcgatcagg 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg gtacctagag 


2880 


5 


CGcatcttca 


aacattcagg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


ggttgtgatc 


2940 




aaggccaact 


ggaaggcacc 


cgcggaaaac 


tttgtggggg 


atgcatacca 


cgtgggttgg 


3000 




acgcacgcgt 


cttcgcttcg 


ctcgggagag 


tctatcttcg 


cgtcgctcgc 


tggcaacgca 


3060 




gtgctgcccc 


ctgaaggtgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


cggcatgggt 


3120 




gtgttgtggg 


acggatattc 


aggcgtgcat 


agcgcagact 


tggttccgga gttgatggca 


3180 


10 


ttcggcggct 


ctaagcagga 


aaggctgaac 


aaagaaattg 


gcgatgttcg 


cgcccggatt 


3240 




tatcgcagcc 


acctcaactg 


caccgttttc 


ccgaacaaca gcatgctgac 


ctgctcgggt 


3300 




gttttcaaag 


tatggaaccc 


gatcgacgca 


aacaccaccg 


aggtctggac 


ctacgccatt 


3360 




gtcgaaaaag 


acatgcccga 


ggatctcaag 


cgccgcttgg ccgactcggt 


tcagcgtacg 


3420 




gttgggcctg 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


agcgtcgcaa 


3480 


15 


aacggcaaga 


aatatcaatc 


cagagatagt 


gatctgcttt 


caaaccttgg 


tttcggtaag 


3540 




gacgtatacg 


gcgacgcggt 


ctatcctggc 


gtcgtcggca 


aatcggcgat 


cggcgagacc 


3600 




agttatcgtg 


gtttctaccg 


ggcttaccag 


gcacacgtca 


gcagctccaa 


ctgggctgag 


3660 




ttcgaggatg 


cctctagtac 


ttggcatacc 


gaactgacga 


agactactga 


tcgctaacag 


3720 




acgagtcgac 


catgatgatc 


aatattcaag 


aagacaagct 


ggtctccgcc 


cacgacgccc 


3780 


20 


aagagtttct 


tcgtttcttc 


aattgccacg 


acgcggcttt 


gcaacaagaa 


gccaccacgc 


3840 




tgctgaaccg 


ggaagcgcat 


ctgttggaca 


ttcaggctta 


ccgggcttgg 


ttagagcact 


3900 




gc9tggggtc 


agaggttcaa 


tatcaggtca 


tttcacgcga 


actgcgcgcc gcttccgagc 


3960 




gccgttataa 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag caactgaaag 


4020 




ttcgaatcga 


gcatcaactg 


gatccgcaaa 


actggagcaa cagcccgaag ctgcgcttta 


4080 


25 


ctcgcttcat 


caccaatgtc 


caggccgcaa 


gggacgtaga tgacgaagag ctacttcaca 


4140 




tccgctccaa 


cgtcattctg 


caccgggcac 


gacgtggcaa 


tcaggtcgat gtcttctacg 


4200 




ccgcccggga 


agacaaatgg 


aaacgtggcg 


aaggtggagt 


gcgaaaattg 


gtccagcgat 


4260 




tcgtggatta 


cccagagcgc 


atacttcaga 


cgcacaatct 


gatggtcttt 


ctgtgatcca 


4320 




gtgaccactt 


ttacaaatgg 


tgactgctac 


cgcgg 






4355 



30 



<210> 8 
<211> 2176 
<212> DNA 

<213> Artificial Sequence 

35 

<220> 

<223> A modified DNA sequence 



<400> 8 

40 gtgatgtcct cggcgtgact gtcgagggta aggatctggc actgtacgaa gtggaaggcg 



60 
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aaatctacgc 


taccgacgac 


ctgtgcacgc 




tcgaggggcg 


agaaa^tcgaa 


tgccccttgc 




gagccctctg 


cgcccccgtg 


acagagaaca 




tgcgcgtaat 


gattgattta 


agctgagaat 


5 


cgtaattatc 


cccattccat 


ctttttttag 




tggtaagtga 


gtctgggctg 


acccaaaagc 




agcacgaact 


gagaaccatt 


tktgcgcgga 




ttccttcccc 


cggcgactat 


gttaccgcaa 




ggcaaagcga 


cggttcgatt 


cgtgccttcc 


10 


tggtgaacgc 


ggaagccggc 


aatgctaaag 




tcggctccaa 


cggcgaactg 


cagagcgttc 




tcaacaaaaa 


atgtctgggg 


ttgaaagaag 




tctatggttg 


cttcgatcag 


gaggcccctt 




ggtacctgga 


acccatcttc 


aaacattcag 


15 


aggttgtgat 


caaggccaac 


tggaaggcac 




acgtcggttg 


gacgcacgcg 


tcttcgcttc 




ctggcaacgc 


agtgctgccg 


cctgaaggtg 




gcggtatggg 


tgtgttgtgg 


gacggatatt 




aattgatggc 


attcggcggc 


gctaagcagg 


20 


ccgcccggat 


ttatcgcagc 


cacctcaact 




cctgctcggg 


tgtbttcaaa 


gtatggaacc 




cctacgccat 


tgtcgaaaaa 


gacatgcccg 




ttcagcgaac 


ggtcgggcct 


gctggcttct 




cagcatcgca 


aaacggcaaa 


aaatatcaat 


25 


gtttcgggaa 


ggatgtatac 


ggcgacgcgg 




tcggcgagac 


cagttatcgt 


ggtttctacc 




actgggctga 


gttcgaggat 


gcctctagta 




atcgctaaca 


gacgagtcga 


ccatgatgat 




ccatgacgcc 


gaagagtttc 


ttcgtttctt 


30 


agccaccacg 


ctgctgaccc 






gttagagcac 


tgcgtggggt 


cagaggttca 




cgcttccgag 


cgccgttata 


agctcaatga 




gcaactgaaa 


gttcgagtcg 


agcatcaact 




gctgcgcttt 


actcgcttca 


tcaccaatgt 


35 


tctgcttcac 


gtccgctcca 


acgtcgttct 




tgtcttctac 


gccgcc 






<210> 


9 






<:2X1> 


14462 
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<212> 


DNA 





atggtgccgc 


ccgcatgagc 


gatggttatc 


120 


atcaaggtcg 


gtttgacgtt 


tgtacaggca 


180 


tcaaaacata 


tgcagtcaag 


attgagaacc 


240 


ttttaatagg 


aggcgccccg 


gaccatagag 


300 


gtgaaaacat 


gaattacaaa 


aacaaaatct 


360 


acctgattca 


tggcgatgaa 


gaacttttcc 


420 


actggctttt 


tctcactcat 


gacagcctga 


480 


aaatggggat 


tgacgaggtc 


atcgtctctc 


540 


tgaacgtttg 


ccggcaccgc 


ggcaagacac 


600 


gtttcgtttg 


cagttatcac 




660 


cattcgaaaa 


agagctgtac 


ggcgagtcgc 


720 


tcgctcgcgt 


a^gagagcttc 


catgggttca 


780 


ctcttatgga 


ctatctcggt 


gacgctgctt 


840 


gcggtttaga 


actggtaggt 


cctccaggta 


900 


ctgcggaaaa 


ctttgtgggt 


gatgcatacc 


960 


gcacaggcga 


atctatcttc 


tcgtcgctcg 


1020 


cgggcttgca 


aatgacctcc 


aaatacggca 


1080 


caggcgtgca 


tagcgcagac 


ttggttccgg 


1140 


aaaggctgaa 


caaagaaatt 


ggcgatgttc 


1200 


gcaccgtttt 


cccgaacaac 


agcgtgctga 


1260 


cgatcgacgc 


aaacaccacc 


gaggtctgga 


1320 


aggatctcaa 


gcgccgcttg 


gccgacgcgg 


1380 


gggaaagcga 


cgacaatgac 


aatatggaaa 


1440 


ccagagatag 


tgatctgatt 


tccaaccttg 


1500 


tctatcctgg 


cgtcgtcgga 


aaatcggcga 


1560 


gggcttacca 


ggcacacgtc 


agcagctcca 


1620 


cttggcatac 


cgaactgacg 


aagactactg 


1680 


caatactcaa 


gaagacaagc 


tggtatccgc 


1740 


caattgccac 


gactcggctt 


tgcaacaaga 


1800 


tctgctggac 


attcaggctt 


accggacttg 


1860 


atatcaagtc 


atttcacgcg 


aactgcgcgc 


1920 


agccatgaac 


gtttacaacg 


aaaatttcca 


1980 


ggattcacaa 


aactggagca 


acagcccgaa 


2040 


ccaggccgca 


atggacgtaa 


atgatgaaga 


2100 


gcaccgggca 


cgacgtggca 


atcaagtcga 


2160 
2176 
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<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 

5 



<400> 9 



gtcgactccg 


gtggccaaat 


cgacgggagc 


acaagcgtta 


cagcatcggg 


gctatcggcc 


60 


tcaaccgctg 


agacgcatct 


acatacccaa 


aagcaatggc 


aagaagcgtc 


cgctgggaat 


120 


tccaacgatg 


cgggatcgtg 


cgatgcaggc 


gctgtggaaa 


ctggcacttg 


agccggttgc 


180 


lOcgaaacgcgt 


gtagatccca 


actcctatgg 


atttcggccg 


cagcgatcca 


ctgccgatgc 


240 


aatcgcacat 


tgcttcaatg 


cactggcgaa 


acgtggttcg 


gcgcactggg 


tactggaagg 


300 


tgacattcga 


ggctgtttcg 


acaatatcag 


tcacgattgg 


ctgctcacca 


acgtgcccat 


360 


ggataaggtg 


gttctgcgca 


aatggcttcg 


agcggggtat 


gtcgatcagg 


gagccttgtt 


420 


cgcaacggag 


gcaggaaccc 


cgcaaggggg 


aatcatctct 


ccggtacttg 


cgaattggac 


480 


15gctggatggc 


ctggaagatg 


tcgtccatgc 


aagcgtggct 


tcgacagcgc 


gcaagcgtaa 


540 


gccattcaag 


atacacgtcg 


tacgatatgc 


cgatgacttc 


atcatcacag 


gggccacgca 


600 


agctgttctg 


caacatcagg 


ttcgtcctgc 


aattgaggcg 


ttcctgaaac 


agcgtggact 


660 


ggaactctct 


gatgaagaga 


ctcagataac 


gcatatctca 


caaggcttcg 


attttctggg 


720 


ccagaacgta 


cgcaagtacg 


ccggcaagct 


actcatcact 


ccggcttgca 


agagtgtgaa 


780 


20ggcactgctg 


gataaggtgc 


gggaaatcgc 


gaacgcaaac 


aaggcggcga 


ctcaagctaa 


840 


cttgatcttg 


accctgaacc 


cggtaatccg 


aggatgggcc 


atgtatcatc 


gccatgttgt 


900 


cgccgccaac 


gtttcgcatg gatcgaccat 


cagatttggc 


aagtgttatg 


gcgctgggca 


960 


gttcgtcggc 


acgccatgaa 


aaatgcctat 


tgggtaaaac 


aacgctactt 


tcgtgtcgta 


1020 


ggtcaacggc 


actgggtttt 


cgccactcag 


gagaaagcgc 


gcggcttgag 


tcaacccgct 


1080 


25tggctgtatg 


ccgccgccag 


tgtttcgatc 


gtgcggcata 


tcaaaatatg 


cagtgcagcg 


1140 


aacccgttcg 


atccggcatg gacgttttac 


cttgagcgtc 


gaagagcgca 


tcgtcaggtt 


1200 


gcccagtctt 


actcaggctg 


ctggaaggct 


tgagccgtat 


gaggggtgac 


tcccaagtac 


1260 


ggttcttagg 


ggagacaggg gtggtaacac 


ccctgtttta 


cccgactgaa 


agcgcatatc 


1320 


ggagccgacg 


cagaatcggg 


cctggtacat 


catgtccatg 


cacagccgcc 


aacgtggctg 


1380 


30acgtgacgga 


agttgcccat 


ctgttgcacg 


gtggcgagaa 


cgttgtctgt 


gcttggggtt 


1440 


acaccggcgt 


agagaaacgg 


cctgagcacg 


acggtcggca 


agtgatctgg 


cagatctgcg 


1500 


cgacgcagca 


cctacaagca 


tttgagcaaa 


cgcagtgcgc 


cgtacaaggc 


caggcgcaag 


1560 


atcgagaagg 


ccaaagccca ggtacgtgcg 


aaggtcgaac 


atccgtttcg 


ggtgatcaag 


1620 


cgtcagttcg 


gttatgtgaa gacccgcttc 


cgtgtcctgg 


ccaagaatac 


ggcggaattg 


1680 


35accacactgt 


tcgcgctgtc gaactctgtg 


gatggtgcgc 


cggcaattat 


tgcctgctgc 


1740 


gggagaggtg 


cgcccgtgag 


tgacagaaaa 


ccagggcttt 


gcctcggtta 


tccaataaca 


1800 


cagcgactga 


aaatcgggca 


tttcggcatc 


cctaagccgc 


ccatttccga 


ctgatgagca 


1860 


acttgttcgg 


agtttcccta 


aagcacaagc 


gcaagccatc 


agaaaagagt 


gcctgagtgt 


1920 


gctccaagat 


agcccgaaac 


taaggccgga 


catggaaaag 


accaagatca 


cccatgtgaa 


1980 


40tgatggactt 


gtttttgtgg gccatcggat 


cattcgcaag 


cgtagccgat 


acggagatat 


2040 



20 



gcgtgtgatc acgacgatcc gagacaaagc cagaagattc gcagcctcac tgacggcgct 2100 

gctatcaggt aaccaCagtg aaagtaagat cgacatggtt gaatcagtca accgaaagct 2160 

taaaggctgg gcggcgttta tcaattcgtt gattcaaagc caaagtattc agctatatcg 2220 

atcgtgtcgt gttttggaaa ctggcccatt ggctggctca aaaataccgc tcccaggttt 2280 

Sgtcttgcctg ttcctggccc gccgatgaac accacattct gcgcggtatc tgtgaactcc 2340 

aggttggacc gctccctatg ggcatcaaca atatctcgtg gccttggctg ggatacgcaa 2400 

ctaaccaaaa tgcgcgactg gcgcatgcag cttccatatg ggagacgctg gaagcgtcgg 2460 

aatagaccgt caatcagaag accttcttcg ctcccgtcga actgaaagcc tcaccgagtg 2520 

cgaacgtcaa cgggaaaggc gcgcaaagac ttgccttgct cacctcgtca cgagcgtaca 2580 

lOaacgctgtga gctggctgta cgggtgcgct gcttagcgtt ggtttttccg ttctgtgagc 2640 

caaccccgtg atatcgtagt gcaggaccat gccgttaagg agggctgcct ggtcagcatt 2700 

ttcagccagc gccggaagtt cctgcacttc ggccttgaac tggcgttcca atgcctggat 2760 

atggccatag ggaggtacgc tttggcgtaa ggcgttgcat gcagcttggt gctgtcgagg 2820 

ctgatggtgc ggagcttgag cagcttcatt tgccctgcca actccaccgc ctgaaggaag 2880 

ISttcccggcca agtcatcgag gaagcgctgc cggaagctgg ccggggtgac tcttgcggcg 294 0 

aggtggcgaa aagcgagcga gtcgtaactg gcgcgctcaa tcctggagct gggaactttg 3000 

ccggtggcat aaccatagac catcgggctg agcacaatgc ctcgggggtg ataagccgct 3060 

accgcgtccg ccatatgcct gagtgagggc gctaaggtcg ggccgctcaa tggcttcgag 312 0 

aataaagcgg gccaggtgat cttctgccat ctctcccgaa tgggtggaag cacatcaacc 3180 

20tgaccagcga ttacgtctgc gcagagccgc agactagagg tcgggaagtt ccggccgtta 3240 

cggctgcccg gaaaacctca gtgtacgatt ttttcagtct cctctattcg acctgcgttg 3300 

cgacgttcga tctgactagg gcgcagcggc aacatcggct gagtacgatc cagcgcctgg 3360 

atttgggtta tttcgttaac cgacaacacc agagcgttat cctgcgggtt caggcagagc 3420 

ccaaccacat gactacctct cggcaaaatg agggtcgtta ctaattttga acgtcttcaa 3480 

25gcgatggggt tttaggtcga aagctgccca ggcctgcgga acctaccaca tcgtaacccg 3540 

agcatattcg gcccgcggcg gctaaaaaca cagaaatgag cggggtgacc cgatcgcctt 3600 

tgatcgattc tccgctttca aaagcggcag gggctgaagt cagccggaaa tacccagacc 3660 

aatcacaaca ttcatgctgg tgataaataa attcaactat gctttattga caaataaaag 3720 

cacactcacc atcatcgcga atacaaatct tataaaatta agccggattt atgaaacttc 3780 

SOtcatacagcc aaacaatcgc ctcattagct ttagtcccgg cgccaacctt ctggaagtgc 3840 

ttcgcgagaa cggtgtcgct atttcctaca gttgtatgtc tgggcgttgc ggaacctgcc 3900 

ggtgccgggt tacagatggt agtgtaattg attcggggac gggaagcggg ttaccacacc 3960 

tcgtggacga gcattatgtg ctcgcctgtc ggtcagtact tactaacaat tgcgcgatcg 4020 

aaatcccaga agccgacgaa atcgtcaccc acccggcgag aatcatcaag ggcactgtgg 4080 

35tcgctattga gfccgcccact cacgatatcc gtcgcctacg cgtacgcctc gccaagccct 4140 

tcgagttctc acccggacag tacgcgacat tgcagttcag tcctgagcat gcgcgtccgt 4200 

attcaatggc aggtctgcca gatgaccaag aaatggagtt ccacatacgc aaggtgccgg 4260 

gtgggcgcgt aactgagtat gttttcgagc acgtccgcga aggtacaagc atcaaattga 4320 

gcgggccact tggtacggct tatttacgtc agaaccacac cgggccgatg ctctgtgtgg 4380 

40gcggtggaac cggactagca ccggtgctgt cgattattcg cggcgcgctg aagttgggta 4440 
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tgacaaaccc 


catcctcctt 


tatttcggag 


tgcgcagtca 


gcaagacctc 


tacgacgcag 


4500 


agcgattgca 


taacctcgcg 


gctgatcacc 


ctcaactgac 


cgtacacacg 


gtaatcgcaa 


4560 


tgggcccgat 


taatgagagt 


cagcgagccg 


gtctagttac 


cgatgcgatc 


gaaaaagaca 


4620 


tcagttcgct 


ggctgggtgg 


agggcctatc 


tgtgcggtgc 


accagcgatg 


gttgaagcgc 


4680 


Stttgcaccgt 


taccaaacat 


cttggaatat 


caccggaaca 


tatttatgcc 


gatgccttct 


4740 


atcccggtga 


aatctgaatc 


gtccctttcc 


ctcacctcgg 


tccattgagg 


actcatcagg 


4800 


aggatactca 


aataggcgca 


aataataaca 


gccgcgtcac 


tatgtgcggc 


agcgaaatgg 


4860 


tttccctctc 


cctcatttgc 


cccatcggag 


gatagtttta 


tgacagaaaa 


atggafctgaa 


4920 


gcagtcgccc 


tttgtgacat 


tccagaaggt 


gatgtcctcg 


gcgtgactgt 


cgagggtaag 


4980 


lOgatctggcac 


tgtacgaagt 


ggaaggcgaa 


atctacgcta 


ccgacgacct 


gtgcacgcat 


5040 


ggtgccgccc 


gcatgagcga 


tggttatctc 


gaggggcgag 


aaatcgaatg 


ccccttgcat 


5100 


caaggtcggt 


ttgacgtttg 


tacaagcaga 


gccctctgcg 


cccccgtgac 


agagaacatc 


5160 


aaaacatatg 


cagtcaagat 


tgagaacctg 


cgcgtaatga 


ttgatttaag 


ctgagaattt 


5220 


ttaataggcg 


gcgccccgga 


ccatagagcg 


tgattatccc 


cattccatct 


ttttttaggt 


5280 


15gaaaacatga 


attacaaaaa 


caaaatcttg 


gtaagtgagt 


ctgggctgac 


ccaaaagcac 


5340 


ctgattcatg 


gcggtgaagg 


gcttttccag 


cacgaactga 


gagccgtttt 


tgcgcggaac 


5400 


tggctttttc 


tcactcatga 


cagcctgatt 


ccttcccccg 


gcgactatgt 


taccgcaaaa 


5460 


atggggattg 


acgaggtcat 


cgtctctcgg 


caaagcgacg 


gttcgattcg 


tgccttcctg 


5520 


aacgtttgcc 


ggcaccgcgg 


caagacactg 


gtgaacgcgg 


aagccggcaa 


tgctaaaggt 


5580 


20ttcgtttgca 


gttatcacgg 


ctggggcttc 


ggctccaacg 


gcgaactgca 


gagcgttcca 


5640 


ttcgaaaaag 


agctgtacgg 


cgagtcgctc 


aacaaaaaat 


gtctggggtt 


gaaagaagtc 


5700 


gctcgcgtag 


agagcttcca 


tgggttcatc 


tatgcctgca 


tcgatcagga 


ggccccttct 


5760 


cttatggact 


atctcggtga 


cgctgcttgg 


tacctggaac 


ccatcttcaa 


acattcaggc 


5820 


ggtttagaac 


tggtaggccc 


tccaggtaag 


gttgtgatca 


aggccaactg 


gaaggcacct 


5880 


25gcggaaaact 


ttgtgggtga 


tgcataccac 


gtcggttgga 


cgcacgcgtc 


ttcgctctgc 


5940 


acaggcgagt 


ctatcttctc 


gtcgctcgct 


ggcaacgcag 


tgctgccgcc 


tgaaggtgcg 


6000 


ggcttgcaaa 


tgacctccaa 


atacggcagc 


ggtatgggtg 


tgttgtggga 


C99^tattca 


6060 


ggcgtgcata 


gcgcagactt 


ggttccggaa 


ttgatggcat 


tcggcggcgc 


taagcaggaa 


6120 


aggctgaaca 


aagaaattgg 


cgatgttcgc 


gcccggattt 


atcgcagcca 


cctcaactgc 


6180 


30accgttttcc 


cgaacaacag 


cgtgctgacc 


tgctcgggtg 


ttttcaaagt 


atggaacccg 


6240 


atcgacgcaa 


acaccaccga 


ggtctggacc 


tacgccattg 


tcgaaaaaga 


catgcccgag 


6300 


gatctcaagc 


gccgcttggc 


cgacgcggtt 


cagcgaacgg 


tcgggcctgc 


tggcttctgg 


6360 


gaaagcgacg 


acaatgacaa 


tatggaaaca 


gcatcgcaaa 


acggcaaaaa 


atatcaatcc 


6420 


agagatagtg 


atctgatttc 


caaccttggt 


ttcgggaagg 


atgtatacgg 


cgacgcggtc 


6480 


35tatcctggcg 


tcgtcggaaa 


atcggcgatc 


ggcgagacca 


gttatcgtgg 


tttctaccgg 


6540 


gcttaccagg 


cacacgtcag 


cagctccaac 


tgggctgagt 


tcgaggatgc 


ctctagtact 


6600 


tggcataccg 


aactgacgaa 


gactactgat 


cgctaacaga 


cgagtcgacc 


atgatgatca 


6660 


atactcaaga 


agacaagctg 


gtatccgccc 


atgacgccga 


agagtttctt 


cgtttcttca 


6720 


attgccacga 


ctcggctttg 


caacaagaag 


ccaccacgct 


gctgacccgg 


gaagcgcatc 


6780 


40tgctggacat 


tcaggcttac 


cggacttggt 


tagagcactg 


cgtggggtca 


gaggttcaat 


6840 
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atcaagtcat ttcacgcgaa ctgcgcgccg cttccgagcg acgttataag ctcaatgaag 6900 

ccatgaacgt ttgcgacgaa aatttccagc aactgaaagt tcgagtcgag catcaactgg 6960 

attcacaaaa ctggagcaac agcccgaagc tgcgctttac tcgcttcatc accaatgtcc 7020 

aggccgcaat cgagctaaat gatgaagatc tgcttcacgt ccgctccaac gtcgttctgc 7080 

Saccgggcacg acgtggcaat caagtcgatg tcttctacgc cgcccgggaa gacaaatgga 7140 

aacgtggcga aggtggagtg cgaaaattgg tgcagcgatt cgtggattac ccagagcgca 72 0 0 

tacttcagac gcacaatctg atggtctttc tgtgatccgg tgaccacttt tacaaatggt 7260 

gactgctacc gcggtcacca ttaatcaaaa gggaatgtac gtgtatgggc aatcaacaag 7320 

tcgtttcaat aaccggtgcc ggctcaggaa tcggtctcga actggttcga tccttcaagt 7380 

lOcggccggtta ttgcgtatcc gctctcgtac gaaacgagga gcaagaggcg cttctttgca 744 0 

atgaattcaa ggacgcactc gagatcgttg tgggcgatgt ccgagatcac gcaataaatg 7500 

agaagctgat caagcagaca atcgctagat tcggtcatct cgattgtttc atcgcaaatg 7560 

ccggtatttg ggattacatg ctgagcatcg aagagccttg ggagaaaatc tccagcagtt 7620 

ttgacgaaat attcgacatc aatgtaaaga gctatttcag tggcatcagt gcagctctgc 7680 

IScggaactgaa aaagacgaac ggatcggtgg tgatgaccgc ttcggtgtcg tcccatgcgg 7740 

tcggtggtgg tggttcttgc tacatcgcca gcaagcatgc ggtgttaggt atggtcaagg 7800 

ctttggccta cgaattggct cccgaaattc gcgtgaacgc tgtctcgccg ggcggcaccg 7860 

tgacgtctct gtgcggtcct gcaagcgccg gtttcgacaa aatgcacatg aaagacatgc 7920 

ccggcatcga cgatatgatc aaaggcctca ctcctcttgg gtttgcagcc aagcccgaag 7 980 

20acgtggtgga gccctatctg ttgctggctt cgcgaaagca gggaaaattc atcaccggca 8 04 0 

ccgtgattag cattgatggc ggtatggcgc tcggtcgcaa gtgagcttgc agccgatcaa 8100 

aggttataga aacattttta ggtgacgccc catgaagaca aaactgttta tcaacaacgc 8160 

ctggatcgat tccagtgacc agcagacctt cgagcgcaag caccccgtca acagcgaggt 8220 

gatgactgag agcgcaaacg ccacggtgac ggacgcgata aaggcggcgc aagtggccga 82 80 

25ggaggcattc aagacctgga aggacgttgg accttcggag cgtcgccgcc ttctcctgaa 8340 

ggtcgccgat gtcatggaaa gtaaaacacc caagtttatc gaagtgatgg ccatggaggt 8400 . 

gggagcttcc gctctttggg ccggattcaa cgtccatgcg tctgccaatg tgttccggga 8460 

ggctgcctcg ctggccactc aaattcaggg cgaaaccatc ccaacggaca aagccgaaac 8520 

gctctcaatg acactacgtc agccggtcgg cccgatcctg agcatcgtcc catggaacgg 85 80 

30caccgcagtg ctggcggcac gagccatcgc gtatccgctg gtctgtggca acacggtggt 8640 

gttcaaaggc tctgaattta gtcccgcgac gcatgccctg atcacccagt gcgtccagga 8700 

agccgggctg cccgctggcg tgctcaacta tctcaactcc tcgcctgacc gttcgcccga 8760 

gatcgccgac gcactgatct ccgcgaagga gatccgccgc atcaacttca cgggttcaac 8820 

ccgcgtgggc agcattatcg cgcagaaggc cgcgcaacac ctcaagcgct gcctactgga 8880 

35gctcggcggG aagtccccgc ttattgttct ggatgacgca aacattgacg cggcggtcaa 8940 

ggcagcggtg ttcggtagct tcctgttcca aggtcagatc tgcatgtcca ctgagcgctt 9000 

ggtggttgat gagaagattg ccgacgaatt tgtcgccaag tttgtcgaaa aaactaagcg 9060 
cttgagcgtg ggcgacccgt gcgtaactgg cgactgcatc atcggcccaa tggtctcgcc 9120 
aaatccgggc gagcggatca atggtttgtt caaggacgcg atcgataaag gggccaaagt 9180 

40tgtttgcggc ggcatggccc agggtgcggt catgccggcc acgatcctgg atcacgtgaa 9240 
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atctgacatg 
caaaggggaa 
cgtgtttggc 
cgtacacatc 
Sgaacaccggc 
gtggctgacc 
aatcaaacga 
aaggatcctg 
gagggtgaga 
iOcatcacggcg 
ttcgaagctt 
gttgaggctc 
ccgaccgaga 
cctctgcacg 
15accgacgtcg 
taccgcattc 
cgtgatcact 
cttgagtaca 
attgacatcg 
20gcaacgcctt 
gcggagtatt 
gatgtaaaac 
ctatacagcg 
ttctttacat 
25ggacataccg 
gaggccagcg 
acttcggcta 
caactaaagt 
ctccaacatt 
30tcggtgcagg 
tcgatgccga 
gcaactggtt 
acttagggcc 
caatcggacg 
35gtatttttat 
gtgcctggac 
acactgtgga 
acggtatttt 
gtgattatgt 
40cgacggccct 



cggatctacg 
gcagaggcca 
cgcgacatca 
aacggctcga 
tacgggcgct 
attgagccat 
tgagtaagca 
atgcgtggaa 
aggaccgttt 
gagaggacga 
tggggcaaaa 
aggagcgtat 
tattctgggg 
gcaagtttgt 
ctgcggccca 
cattgcccaa 
ccattgcgtt 
cccatatgga 
ccttgcagct 
cgggctggct 
acgtcggcga 
tgagttaaaa 
caataacaat 
gcctcgcttt 
catcggcatg 
tgtgccggga 
tgccatcaac 
taagggcgca 
actgacgtta 
agtcaattac 
ccacgcttgg 
cgtcagcatt 
acaggaggct 
ccaattttga 
gttgaataaa 
tataatgccc 
cttagacgag 
gagtatgggt 
ttcgacggtt 
gaatacccga 



atgaggagac 
tccgcattgc 
accgtgctct 
ccgtccagaa 
tcgacggccg 
tcgagcagca 
agctgcagtt 
atcgtttgcc 
ctatctgcgg 
cttggaatat 
gctcattgat 
ggtgttgggc 
gccccggatc 
gaccggtgac 
taaattctac 
cggcatgact 
tggtgccatg 
agacttggga 
tggcattcac 
tatcgagccc 
catcttcggc 
gcgattatgc 
aacaactaag 
acttgttcac 
agtaatattg 
ggaagcgctg 
gaccaatgga 
ggcacacttc 
aactataacc 
acgcgaattt 
tcccccgcgc 
gatattcgga 
aaagcaaagg 
tgcccccttt 
attagtaaaa 
acaccgtcga 
actgcccgca 
acctttggtg 
gtcgagacca 
gaagtcattc 



ctttggtccc 
caacgacagc 
gcgagtgggt 
cgaggcgcag 
tgccgtggtc 
atatcccttc 
atcgagctcg 
atgaatatgc 
atggattact 
ctaggctggc 
gccggttaca 
ctgatgaaga 
gacatgagca 
cagggcttag 
agcctgctgg 
gccgaactgt 
cctgctgcca 
tatacgcatc 
gccaacgaca 
ggctggcgag 
catggcgtcg 
gtacatcggg 
tgggaatatt 
tcagcaccgc 
cttttgatac 
acgcgagcaa 
atgcacgttt 
cgggcattca 
tcccagcttt 
tggaaagcaa 
tgcatgttgg 
agttatacct 
taactcttga 
taaggctctc 
ccgcgcgtct 
cgcctgatgc 
tagttgaaga 
agtgcgccac 
ttcgcggtcg 
ggcagacccg 



atcacagtgg 
gtttatggtc 
atgtcgatcg 
gctccttacg 
gacgagttca 
tgaaagcact 
gatacatggg 
tgggtctgca 
ggcaccatcg 
gtgtagccgg 
agatccgtgt 
cagaagatcc 
atccgttcca 
gtcattgcat 
gcttccgtgg 
cattcatgca 
agcggctcaa 
aacagttcgt 
aggcgttgac 
gtgctacggc 
aggcacctgg 
tttctgacgt 
aataatgata 
atgggccgaa 
aagcgccaaa 
taacaacgcg 
gattgtcggt 
gctgggaaaa 
tgggcccgtt 
ggacgctaat 
tgccgatatt 
gaaaaccgac 
cccattgata 
tatctatcta 
taccgctgaa 
ttctgattgg 
gctgattgct 
gttgacctgg 
tgtaccttat 
agagcttatc 



tgatccgttg 

tgtcgtcggg 

aatatggttg 

gaggtaccaa 

cagagctcaa 

aactccaagg 

catctcagtc 

agtactcgat 

tatcgtagtt 

caagccggag 

ctgcgacaaa 

aggcggcaac 

tcccggccgc 

cgttcgccaa 

ggacgtcgaa 

ttgcaacgcc 

tcacctgatg 

gaagaacgaa 

gttctacggc 

catagacgaa 

atatggcctg 

tttctgcttt 

aaaaaaacgt 

gaatcacctt 

gtgtacttgg 

ctcacattcg 

attccaccta 

ataacttatg 

cgtcctcaca 

ctaaaatcat 

gacgtgagtc 

gcatcaggtt 

acatcgatcg 

actgcaaagg 

gatatcaatg 

cgcacgacta 

gctggtgtca 

gaggagaaac 

ttctgcggca 

gatattggcg 



9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
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ctaacggcac catgctaggc gtgccgatgt gggttaagat ggacctgccc acagcggtcc 11700 

agttctatcg tgatgttgca ggcgcggtac cggaggctgc cattgcgatt tacgccaacc 11760 

ccgaagcatt caaattcgac ttccctcgcc cattttgggc agagatgtct aaaattcctc 11820 

aggtagtgac tgccaagtat ctaggcatcg gaatgcttga cttggacctg aaattggcgc 11880 

Sctaacatccg cttccttcca cacgaggacg actattacgc ggccgcacgc atcaatcccg 11940 

agcgcattac tgcgttctgg tcaagcgggg ccatgtgcgg cccggctacc gctatcgcgt 12000 

tgcgtgatga agtggagcgg gccaagagta ccggtgactg gatcaaggcc aaagccatct 12060 

ccgatgatat gcgtgcagcc gattcgacat tgtttccgcg tggcgacttt tcggagttct 12120 

cgaagtataa tatcgggctt gaaaaggcac ggatggacgc ggctggttgg ctcaaggctg 12180 

lOgtccctgccg tcctccctac aatcttgttc cagaagatta cctcgttggt gcacagaaat 12240 

caggcaagtc gtgggccgcg ctgcacgcta aatacagtaa agaattaaag tagttcacct 12300 

ccgcatgcct gagcgcgagg ggtgacgtaa acgccgagcg gtgcgggaag taagtgagtt 12360 

agagttcatt tcttgtgcca ggcactgcta gatcagcaaa gttagctgat ctagcagtct 12420 

cgaaaatttg ggcgaaagct gatcttagga atgcgggata aaggcagtac accgtaacga 12480 

I5tcggggtgtg ccgttcatgt tgaacgacac cgctattgcg ccgacttctc ttcttcggag 1254 0 

tgtttgattg tgattgtcga tttctatttc gattttttga gtccgttctc ttacttggcc 12600 

aatcagcgtt tgtcaaagct tgcgcaagat catggcctta ccacgtgtta taacgcgatc 12660 

gatttggcgc gggtcaaaat agcgatcggt aacgttggtc catctaatcg cgacttggaa 12720 

gtaaaattgg actatttgaa agtagatttg caacggtggg cccagcttta cggaataccg 12780 

20ctggtatttG cagctaacta caacagcaga cggatgaata ctgggcttta ttactcggag 12840 

gccgaggtgc aggccgctgc ctatgtgaat gtagtattta atgcgatttg gggagaaggc 12900 

atagcaccag atttggaaag cttgcctgcc ctggtatctg aaaagctagg ctgggatcgt 12960 

agtgccttcg agcgctttct cagcagcaac gccgcaacag agaggtatga cgagcagaca 13 020 

catttcgcca tcgagcgcaa ggtgttcggt gtgccaacga tgtttttggg cgatgaaatg 13080 

25tggtggggga acgaccgtct gtttatgctc gagagcgcaa tggggcgctt gtgccggaaa 13140 

aatgccgatt taagtagttg atctgatcgt tatttgctcg atgagtcgct ttcaagatca 13^00 

gcggatactg aagtcagcta aatgcgggac tacttcaggc catgcttcag gggcggtcag 13260 

ggcggactgg cgtgtgcacg gtagttgttg cgtgaggcat gtttttaaga tactattttc 13320 

agtgcctgct gctctctctg ggttgagtgt gagggcgccg atagtctcaa atctcgcgtt 13380 

30gtacctgacc agagatacgg gtgtttatga tggcatcgct ggtacagcgc atgacgccat 13440 

gctgcttcac ttcgagtgta ttcacgacct cgacgctagc aatctcaacg agccagatga 13500 

tcgccagcca gggcggaagc tagtcgtgat tcgggttttc tacgttgtac ccgaatgcaa 13560 

tcctggtctt aacggtccac gccttaaacg tggaatgcca aatctgcgat gtcgacattg 13620 

gcccatataa tgttttggct ctcctctagt cagaaatcag taaggtgtgc atggtgataa 13680 

35ctggtcaagc cgggaatcag cggaatcgct gcgctggcgc cactggcgat caaacagcgg 13740 

tcaaatacta ccattgaact ggcctaagag ggtgtggaca aaataaagta aacttcttgc 13800 

ctcctttcta tgcagcattg agccatgcct aaaactggac gcccaccgta atcgctacgc 13860 

accattaccc tctgctgacc aggctcgctc atacgcagcc ttattccagt caagccgaac 13920 

tggcgcaaga ttccatgcag aaacgggcat caccgcgcat cccgacacct ttgcaaaagc 13980 

40gctgagactg gccgggatcg ttcgagtaaa ggagcgtgcc aaaggtagct tccagcctcc 14040 
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cgagcctcgt aagtcttatg 
ctatcccagt tgcctgactg 
999^9gtcgc ggcgtaccgc 
gtcgtgcgca cagggtgctc 
Sgtctataaaa cctttcggcg 
tccgtgccca atggcgagag 
attcccaatc aacgcgcacg 

ag 

10 

<210> 10 
<21X> 12808 
<ai2> DNA 
15 <213> Artifici* 

<220> 

<223> A modified DNA sequence 
20 <400> 10 

gtcgacgcat gatgcaggac tgggccgatc gcctcgacct cttcgagcag aaccaggtcg 60 

ag9C99ccag catgccgctc accgtgcatc tggaaggtgt gcccgcattc ccgagtgagc 120 

aaaccgcaag cgcgccctct acgccggttg ccgcttcgcc aagcctgctc gtgacgaagc 180 

cgggtgacgc catgccgttg gtttctgccg ccgcacatcg gctgccggca gtgccgcccc 240 

25agcgatcggc cgcgccgctg gtgccttcgg acattcagcg cgagagaatg gatttgttcg 300 

atgtcttcga agcgccgcac aaccttcccg tcgctgcgtt tgccaagatg gcgggcaaat 360 

ctcgcaggtg gatcagctac gagatccagg cgggcaactt gctggcgttg aacgtgggca 420 

accgcggcca gcgcgtgccg gactggcacc tcgacccgct caagcacgag ctgatccagt 4 80 

ctgtcctgaa gctgaccagg ggtgcggacc cttggcagat ctaccatgca ctgctgcagc 540 

BOcgcgctcgat gctgcggggg cgctcggcac tggagggcgt gactgccagc aatctcgaca 600 

agctcgtcat ggccgtgagc acagcggtga aggaaagcga atggaccccg ctgcgggtcg 660 

gtggtcgtgt agtcaaacaa cgggccggta agggatggcc tggacafctat tcatattagt 720 

gatactaata ttcatttatg gtttattgac tattagaata cagtcaatga tcatggaacc 780 

ataaagcata taaataaaga agctagatct atggaacttc tcgtactacc gaacaatcgc 840 

35cgcttgcctt ttgattccgg tgccaacctt ttggaagtgc tccgtgagca ccgtgtgggt 900 

atttcctaca gctgtatgtc tggacgatgc ggtacttgcc gctgccgagt tatagatggc 960 

agcgtcatta gttcggcggc gaaaagcggt gactcaaatc gcatcgaaga gcattatgta 1020 

ctcgcctgtc agtcagtgct caccagcaat tgcgcaattg agatcataga ctcagacgac 1080 

atagtcactc acccggcgcg aatcatcaaa ggcatggttg tcgccgtcga gtcgcccact 1140 

40cacgatattc gccgcatccg cattcgcctc gccaagccct tcgagttctc acccggacag 1200 



gctataccga ggcacaccgg cgtcagttgc cggagcaacg 14100 

atgcagaatg gacgttggtc gctgatttat ttgaagtctc 14160 

tcgccactcc cgacgcactc ttttggaagc ctgctgttat 14220 

ctggcgaatg ctgcttcgcg agtttcccta ttgggacaat 14280 

ctggagcgtc agggcaagtt tgagcaaatg cacgaccggc 14340 

cgggtggatc gtgatgaacg gccgtcagcc gctgttctgg 14400 

tctccccaag gcggtgaaag cggttacgac gcaggcaaaa 14460 

14462 



al Sequence 
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tacgcgatgc tacagttcag tcccgaacat gtgcgtccat attcaatggc tggtctgcca 1260 

gatgaccaag aaatggagtt ccatatccgc aaagtgccgg gcgggcgtgt cacggagtat 1320 

attttcgagc acgtccgcga aggtacaagc attaagttga gtgggcctct tggtacggcc 1380 

tatctgcgtc aggctcacac cgggccgatg ctgtgtgtgg gcggcgggac cggactcgca 1440 

Sccggtgctgt cgattgttcg cggcgcgctg aagtcgggaa tgacgaaccc catccacctt 1500 

tatttcgggg tgcgcagtca gcaagacctt tacgacgcag accgattgaa ccaactcgcg 15 60 

gctatccacc ctcaactgac tgtccataca gtgatcgcga cgggcccgat taatgagggc 1620 

cagcgggccg gcctaattac cgatttgatt gaaaaagaca ttccctcgct ggctgggtgg 1680 

agagcctacc tgtgcggcgc accagcgatg gttgacgctc tatgcaccgt cgccaaagafc 1740 

lOcttggaatat cgcccgagca tatttatgcc gacgccttct atcccagcgg ggtctgaatt 1800 

gccccgaccc ttcacctctg tacatcgaga attcatcagg aagacactta aatgagcatc 1860 

actaacaaca gccgcgtctg aatatttagg acagctggat gatctctaac tccatcatta 1920 

ccccatttga agatagcttt atgacagaaa aatggattga cgcagtcgct ctttatgaaa 1980 

tccctgaagg tgacgtcctc ggcgtgacag tcgaaggtaa ggaactagcg ctgtatgaag 2040 

15tggaaggcga aatctacgct accgacaacc tgtgcacaca tggtgctgcc cgcatgagtg 2100 

atggctttct agaaggcaga gaaattgaat gtcctttgca tcaaggtaga tttgatgttt 2160 

gcacaggcag ggccttgtgc gcccctgtga cacagaacat caaaacatac ccggtgaaga 2220 

ttgagggcca gcgtgtgatg attgatttga gctgagaatt ttaataggag gcaccccgga 228 0 

ccctagagcg taatcccccc cattcgatct cttgaggtga aaatatgaat tacaaaaata 2340 

20aaaacttggt gagtgaatct gggctgaccc aaaaacacct gattcatggc gacgaagaac 2400 

ttttccagcg cgaactggaa accatttttg ctcggaactg gcttttcctg actcatgaca 2460 

gcctgattcc gtcccctggc gactatgtta cggcaaaaat gggggttgat gaggttatcg 2520 

tctccaggca gaacgacggt tcgattcgtg cttttctgaa cgtttgtcgt caccgtggca 2580 

agacgctggt acacgcagaa gcaggtaatg ctaaaggttt cgtttgcagc tatcacggct 264 0 

25ggggcttcgg cgctaacggt gaactgcaga gcgtcccgtt tgaaaaagaa ctgtatggcg 2700 

aggcgctcga caagaaatgt atgggattga aagaagtcgc tcgtgtagag agcttccatg , 2760 

gcttcatcta tggttgcttc gatgaggaag ccccttctct caaagactac atgggggacg 2820 

ctggctggta cctggagcct atgtttaagc attccggagg gctagaactg atcggtcctc 2880 

caggaaaggt cataatcaag gctaactgga aagcgcccgc ggaaaacttt acgggggatg 294 0 

30cgtaccacgt gggttggacg catgcgtctt cgcttcgctc agggcagtcg gtcttctcgt 3000 

cgttagctgg caacgcagct ttgcccccag aaggtgcagg tctgcaaatg acctccaaat 3060 

acggcagcgg catgggtgtg ttgtgggacg gatattcagg cgtgcacagc gcagacctgg 3120 

ttccggaatt gatggccttc ggcggtgcta agcaggaacg gctgaacaaa gaaattggcg 3180 

aggttcgcgc acgaatctat cgcagccacc tcaactgcac cgttttcccg aacaacagtt 3240 

35ttctgacctg ctcgggtgtc ttcaaggtat ggcacccgat cgacgcaaat accactgagg 3300 

tatggaccta cgccatggtc gaaaaagaca tgcccgagga tctcaagcgc cgcttggtcg 3360 

acgcggttca gagaacggtt gggcctgctg gcttctggga aagcgacgac aacgacaata 3420 

tggaaacggt atcgcaaaac gccaagaaat atcagtccag agatggcgat ctggtttcca 3480 

acctgggttt cggcggggac gtatacggcg acgaggttta tcctggcatc gtcggcaaat 3540 

40cggcgattgg cgagaccagt tatcgtggct tctatcgggc ttacggcgcg cacatcagca 3600 
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gctctagctg ggctgaattc gaggatgtct ctaaaaattg gcataccgaa ctggcaaaga 3660 

ctactgatcg ctaacdgacg agagggacca tgatgattaa tattcaggaa gacaagcttg 3720 

tctccgccca cgacgccgaa gagtttcttc gtttcttcaa ttccggcgac gaggctttgc 3780 

aacaagaagc taccacgttg ctaacccggg aagcgcatct tttagacatt caggcttacc 3 840 

Sgcgcctggtt agagcactgc gtggactcag aggtgaaata tcagattatc tcacgcgaac 3900 

tgcgctcagc ttccgagcgc cgttaccagc tcaatgaaac catgaacatt ttcaacgaga 3960 

attatgaaca actggaagtt cgcgtagcgc atcaactgga tccgcaaaac tggggcaata 4020 

gtccaaaggt gcgctttact cgtttcatca caaatatcca ggctgcaatg gacgaaaatg 4080 

aagatttgct tcacattcgc tccaacctaa ttgttcaccg agcacgacgc ggcaatcaag 4140 

iOtcgatgtctt ctatgccact cgggaggata aatggaagcg cggcgaagat ggagcgcgta 4200 

3^9ttggtcca acgattgatt gattatccag agcgcacatt ccagacgcac aatgtgatga 4260 

tctttatgtg acccaataat cgcctttaca aatggtgact gctacaagcg gtcccattgt 4320 

tcaaaaggaa atttatgtgt atgagcaatc aacaagtcgt ttcgataacc ggtgctggct 4380 

caggaattgg tctcgaactg gttcgatcct ttaagtccgc cggttattgc gtatccgctc 444 0 

IStcgtacaaaa cgaggagcaa aaggcgagcc tttgcaatga gttcaaggac gcactcgaga 4500 

tcgtcgtggg cgatgtccgg gaccacgcaa caaatgagaa gctgataaag caaacaaccg 4560 

atagattcgg ccatctcgat tgtttcattg caaatgccgg tatttgggat tacatgcttg 4620 

gcatcgaaga gccttgggag aaaatatcga gcagttttga tgagatattc aacatcaatg 4680 

tcaagagcta tttcagcggt atcagggccg ccctgcagga actgaaaaag actagcggat 4740 

20cagtggtgat gaccgcttca gtgtcgtccc atgcggtcgg tgctggtggt tcttgctaca 4800 

tcgccagcaa gcatgcggtc ctgggcatga tgaaagcttt ggcttacgaa ttggctcccc 4860 

acattcgcgt caacgccgta gcaccgggcg gcactgtgac gcctctgagc ggtcccgcaa 4920 

gcgccggctt cgacaaaact cacatggaaa acatgcccgg tatcgaggac atgatcaagg 4980 

gtctaacgcc tcttggaatt tcagccaagg ccgaagacgt agtggcaccc tatttgttgt 5040 

25tggcgtcgcg agatcaaggg aaattcatta ccgggactgt cattaatata gatggaggga 5100 

tggcgctcgg tcgcaagtag gtttgtcgcc tatcttgaaa taataactaa atttctggta 5160 

aaaccgcatg aatacaaaat tgtttatcaa caatgtctgg atcaattcca gtgaccaaca 5220 

gaccttcgag cgaaagcacc ccgtcagtgg tgaggtgatg acggagtgtg caaactccac 5280 

ggtgatggat gcgttaaagg ccgcgcaagc tgcccaagag gctttccaga cctggaagac 5340 

30tgttggacct tcggagcgtc gccgccttct gctgagggtc gctgaggtta tggaaagtaa 5400 

aacacccgag tttatcgaag tgatggccaa ggaggtggga gcctccgctc tttgggccgg 5460 

cttcaatgtc cagatgtcag ccaatgtgtt ccgtgaagcg gcatcgctgg ctacacaaat 552 0 

tcagggggaa actattccga cagacaagtc tgacacgctc tcaatgacgc tacgtcagcc 5580 

ggtcggtccg atcctgagca tcgtgccgtg gaacggcacc gcagtgctgg cggcacgagc 5640 

35catcgcttat ccgctggtct gcggcaacgc ggtggtattc aaaggttctg agtttagtcc 5700 

cgcgacgcat gccctgatca cccagtgcgt gcaggaagcc gggctgcctg ctggcgtgct 5760 

caactatctc aactcttcgc ctgaccgttc gcccgagatc gccgacgcac tgatctcagc 5820 

caaggagatc cgacgcatca acttcacggg ctccacccgc gtgggcagta ttatcgcgca 5880 

gaaggccgcg caacacctca agcgctgcct gctggagctc ggtggcaagt ccccacttat 5940 

40tgttctggat gatgcagaca tcgatgcggc ggtcaaggca gcggtgttcg gtagcttcct 6000 
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gttccaaggt cagatctgca tgtccactga gcgcttgatc gttgatgaga agatagccga 6060 

cgaatttgtc gcaaaa*tttg tcgaaaaaac taagcgcttg agcgcaggcg acccgtgcgt 6120 

aactggcgac tgcatcatcg gcccgafcggt ctcgccaaat tcgggtgagc ggatcaatgg 6180 

tttgttcaaa gacgcgatcg acaaaggggc aaaagttgtt tgcggcggct tggcccaagg 6240 

Stgcgctcatg ccggccacga tcctggatca cgtcaaatct gacatgcgga ttfcacgatga 6300 

ggagaccttt ggtcccatca ccgtggtaat ccgttgtaaa ggcgaagcag aggccgtccg 6360 

cattgccaac gacagcgtct atggcctgtc gtcgggcgta tttgggcgcg acatcaaccg 6420 

cgctctacgc gtgggtatgt ccatcgaata tggttctgta cacatcaacg gttcgaccgt 6480 

ccagaacgag gcgcaggctc cttacggagg caccaagaac accggctacg ggcgcttcga 6540 

lOcggccgtgct gtaatcgacg agttcacaga gatcaagtgg ctgaccatcg aacctttcga 6600 

gcagcaatat cccttctgat aagcactaac tcccaggaat caaactatga gtaagcaagc 6660 

tgcagttatc gagctcggat acatgggtat ctcggtcaag gaccctgatg cgtggaaatc 672 0 

atttgccacg gatatgctag gtctgcaagt tcttgatgag ggtgagaagg accgtttcta 6780 

tctgcggatg gattactggc atcatcggat cgtagtccat cacaacggac aggacgactt 6840 

15ggagtaccta ggctggcgtg tagccggcaa gccggagttc gaagctctgg gtcaaaagct 6900 

tattgatgcc ggttacaaga tccgcatctg cgacaaagtt gaggctcagg agcgtatggt 6960 

gttgggtctg atgaagacag aagatccggg cggcaacccg accgagatat tctggggccc 7020 

ccggatcgac atgagcaacc cgttccatcc cggtcgcccc ctgcacggaa agtttgtgac 7080 

cggtgaccaa ggcttgggcc attgcatcgt tcgccaaacc gacgtcgcag aagctcataa 7140 

20gttttatagc ctgctgggct tccgtgggga cgtcgaatac cggattccgt fcgcccaacgg 7200 

catgactgcc gaactgtcgt tcatgcattg caacgcccgt gatcactcca ttgcttttgg 7260 

tgccatgccc gctgccaaac gactcaatca cttgatgctt gagtacaccc atatggaaga 7320 

cttgggatac acgcaccaac agtttgtaaa gaacgaaatt gacattgcct tgcagcttgg 7380 

cattcacgcc aacgacaagg cgttgacgtt ctatggtgca acgccttcgg gctggctcat 7440 

25tgagcccggc tggcgaggtg ccacggccat agatgaagcg gagtattacg tcggcgacat 7500 

cttcggccat ggcgtggagg ccactggata tggcctggat gtaaaactga gctaaagatg 7560 

cgcgctcgtt gggcgaggct ctagtccagc atcttcatac gcaaccaacc ttgcagggcg 7620 

atgagatcaa aggacgttaa agcgaagggg aagtggttcg ggccatgcgc ataccgatcc 7680 

atgacatttg tttcatagta tataggtaga taggtgaatc aagcgcttag tcaactagtg 7740 

30gacacatctg ttccatgagg ctatctacta tctattcaaa acaagaatga taaataggat 7800 

gaaaataata atgataaaaa gaacgatttg tcttgtgtat cctctattct gtttggcaag 7860 

ccccacatgg gccgaagagt cgccttggac gtaccgtatt ggtatgacta atgtagcttt 7920 

cgatgctagc gcaaaagtat acttaaatgg tcagcgggtg ccaggaggaa gcgctgatgc 7980 

gagcgataac aacgcgctta cattcgactt cggctacgcc atcaacgacc agtggaatgt 8040 

35acgtgcgatt gtcggtattc cgcctacaac taaagtgacg ggcgcaggca cacttcctgg 8100 

tatccagctg gggaaaataa cttacgctcc aacagtatta acgttgaact ataacctccc 8160 

cgctttgggt cccgttcgcc ctcacatagg tgcgggagtc aactacacgc ggatttttga 8220 

aagtcgggac gctaatctaa aatcgttcga tgccgaccac gcttggtccc ccgcgctaca 8200 

tgttggtgcc gatattgacg ttaaccgtgg ttggttcgtt agcattgata tccggaagtt 8340 

40atacctgaaa accgacgcat cagggtactt ggggccacag gaggctaaag cacgggtaac 8400 
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tcttgaccca ttactaactt cgatcgcgat cggacgccaa ttctgatgat tctgtttaaa 8460 

gttctttatc tatctaaccg caaagggtgt ttccatgtcg aataaaatta tgaaaacgtc 8520 

gcgtcttacc gccgaagata tcaacggcgc ctggactata atgcccacac cctcgacgcc 8580 

tgatgcttct gattggcgca gcactgccac cgtggactta gaagagactg cccgcatagt 8640 

5tgaagagctg attgcagctg gtgtcaacgg tattctaagt atgggtactt ttggtgagtg 8700 

cgccacgttg acctgggatg aaaaacgtga ttatgtctcg acgattgtcg agaccattcg 8760 

tggtcgcgtg ccttatttct gtggcacgac agccttaaat acccgagaag tcatccgcca 882 0 

gacccgagag cttatcgata ttggcgccaa cggcactatg ctcggggtgc cgatgtgggt 8880 

gaagatggac ctgcctacag cggttcagtt ctatcgtgat gttgcagatg cggtaccaga 8 940 

lOggctgccatt gcgatttacg ccaaccccga agcattcaag ttcgacttcc ctcgcccatt 9000 

ctgggcagag atgtccaaaa ttccgcaggt agtgactgcg aagtatctag gcatcggaat 9060 

gcttgacttg gacctgagac tggcacccaa catccgcttc cttccccacg aagatgacta 9120 

ttacgcggcc gcacgcatca atcccgagcg cataaccgcg ttctggtcaa gcggggccat 9180 

gtgcggcccg gctaccgcca tcatgttgcg tgacgaagtg gtgcgggcca agagcaccgg 9240 

15tgactgggcc aaggccaaag ccatctccga tgatatgcgt gcagccgact cgacattgtt 9300 

tccgcgtggc gacttttcgg agttctcgaa gtataatatc gggcttgaaa aggcacggat 9360 

ggatgcggct ggttggctca aggctgggcc ctgccgtccg ccctacaacc ttgttccaga 9420 

agactacctc gctggtgcac agaaatcagg caaggcttgg gccgcgctgc acgctaaata 9480 

cagtaatgaa ttgaagtagt tcacctccgc agacctgagt gacagggtgg cgcagacgct 9540 

20gagggtgcag gaattaagtg agctaaagca catttcttgc gccaggcatt gccagatcag 9600 

caaagtttgc tgatctggca gtttcaaaaa tttgggcgaa agctgatatc aggaatacgg 9660 

gataaaggca gtgcaccata acgacggggg cgtgccattc gtgatgaacg attttgctat 9720 

tgtgccgact tctgttcttg gagtgtttga ttgtgattgt cgatttttat ttcgattttt 9780 

tgagtccgtt ctcttacttg gccaaccatc gtttgtcaaa gcttgcgcaa gactatggct 9840 

25tttccattcg ttattacgca atcgatttgg cgcgagttaa aatagccatc ggaaacgttg 9900 

gtccatctaa tcgcgacctg atagtcaagc tggactattt gaaagtagat ttgcaacggt 9960 

gggccgagct ttacgaaata ccgttggtat tcccagctaa ctacaacagc cgacggatga 10020 

atactgggct ttattactcg ggagccatgg cacagactgg tgcctatgtg aatgtagtat 10080 

ttaatgcggt ttggggagat ggcatagctc cagatttgga aagcttgcct gctctggtat 10140 

30ctgaaaaact aggctgggat cgtagcgcct tcgaggactt tatcagcagc gatgccgcaa 10200 

cagagaggta tgacgagcag acacatgccg cgatcgaacg caaagtgttc ggtgtgccaa 10260 

cgatgttttt gggcgatgaa atgtggtggg gaaacgaccg tctatttatg ctcgagaacg 1032 0 

cagtgggagg tgcgcctgta aatggagaat agtcgctacg gagcgcttgt gccggctaaa 10380 

tgccgatata agtggttgac ctgatcgtta tttgctcgat acagcgcttt caaaatcagc 10440 

35ggctactgaa gtcagataaa aatgcgggac tacttcaggc atcctgtgcg acacaaagtt 10500 

ttacctgtaa ttgtccacct attccgagtt tggaatggta gctgactcgc tatgcgacca 10560 

gcgatagcct aacaagacat gcatcactgg taacggtggg gtgtgaagct cctgcaacaa 10620 

tgtagcccct tgatgtgtgt atttgctgcg aggtgaagca cagatgctcg gagccgtacc 10680 

ggcttgtggc gctaggctgg caagtatgag caacgtaagt gggggttggg gcgcaatggg 10740 

40aaccaaaaac caacgcaagc cttaccagcg tcgttcggtg ccttcctccc atgcctccgc 10800 
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ctcgataaag cagctgcgca tatcggcttc ctggctgatc tcggttagta ggtcatgcaa 10S60 

ggtcttgtcc agcgcctcgt cgctccgata cggaatggtc agctcgtaat ggccggtctc 10920 

cgaccgcttc atgccgtagg gctccaggca gtagcgctcg atgttctccg tggcccgctt 10980 

ccgaccgcgc atgaacttgc tgttgttcac caccgccagg cgcagggtga cggtggccac 11040 

Sccgctcgacg gttgactctg ccggtgacgc gatattgcgc ttttgacctc gcgccagggc 11100 

gctcttctgg tacgtcccga tctcgacgcc acggtggcgt aggtagctgt acagggtgct 11160 

cttggagatg tgcaacttct cgccgatggc gctgacgctc aggcggccct cgcggtacag 11220 

ggtctccgcc gccatggcgg tggcctcggc cttggctggc aggcccttgg gacggcgacc 11280 

gatccggcct cgagtccgtg ccgccgacag gcccgcctga gtccgctcgc ggatcagctc 11340 

lOgcgctcgaac tgaacaggtt gaacaccagg cgatcttggg cgtgggtgct gtcaatgggg 11400 

tcgttcaggc tctgcaagcc gactttgcgt gcagccagct agccgaccaa ctcaaccagg 11460 

tgcttgagcg agcgaccgag gcgatccagc ttccagatca ccacggcatc gcccgctcga 11520 

acatgggcta gcaacttgtc caactccggc cgcgcgcttt ttgcgccgct ggcgatgtct 11580 

tgatagatgc gttcgcaccc ggcctgtttc agggcatcga cctggaggtc ggcgttctaa 11640 

tStcccgagtgc tcacccgcgt ataaccgatc ttcataaaaa gtaccgttta cttgactgca 11700 

ttagtaatag ttgaactttg attaagctta ccagttattt gaaccctagc gcaggtgtaa 11760 

gcgtccagcc gccccacctc tactcagctt gatgaaccga ggggcagtag ttcatcaatc 11820 

cggctgttag ggaaactctg aagaagactt cctgattttg gcaaaatgcc cggatttcac 11880 

ccgccgagtt ttccaatgaa gcagatgacc ttcctcgacg ccgagtatgc cggtaagcgc 11940 

20aaacagaccc gcaaagagct gttcctgatc gagatggatc gggtggtgcc gtggaagggt 12000 

ttgattgcct tgatcgatcc gcattacccc aagggtgagg gtggcccgcc agcctatcca 12060 

ctgacggcaa tgctgcgggt tcatctgatg cataagtggt tcggctacag cgacccggcg 12120 

atggaggaag cgctgtacga gacgaccatc ctgcgccagt ttgccgggtt gagcctggag 12180 

cgcattccgg actaaaccac catccccaac ttccgccggc tgctggagaa acacgaacta 12240 

25cctaccggca tcatgtttgc tgaattttcg cctttggctt tgagtgcggc attcaaccga 12300 

tacaggtgtg gcatgacacg ccaaagaaat caaccttggc ggctagaaca cttgtcgatg 12360 

aagcgaacaa tccgcaaacg ttaaggctac caggaaaaac ccaggatgcg gtagccgata 12420 

gcccgcgcgc gcattggtct ccttccgtcc aagagtgtca aaggatattg gactaatgca 12480 

gcatgcctcg accacatggc cgtggtccgt tcatctgttg gccatacgtc tatcgaatcg 12540 

30gacgcgggcg gcgacgacct cgggttgttg gcgttcggcc cagtcgatga gctcagtcag 12600 

ggacggcatc agcgaagtcc cgagcggggt gagcccgtag cgcacagatg ggggtgtcga 12660 

gggcgttacc tcgcgccaga tgagtccgtc gcgttcgagg tgacgcagcg tctcggtcag 12720 

catccggcgc gaaatgtcgg gcacggcgcg ggccagcgca ttgaatcgtt gcggaccctg 12780 

cgacaaggtg accaggatca gcgtcgac 12808 

35 

<210> 11 
<211> 15024 
40 <212> DHA 
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<213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 

5 



<400> 11 



atcgtcctta 


agcgccgcac 


gcagccgttt 


aagtgaattg 


ctgacggcag 


gctgcgtcag 


60 


ccccagtttt 


tcgccggccg tcgatacgct 


ccggtcgagc 


agtagctggt 


tgaagaccac 


120 


cagcagattc 


aagtcgatgt 


cgcgcagatc 


catgatgcct 


caccattatt 


cafcgctggtg 


180 


lOattttaacta 


tcagacttga 


tctatagcgc 


tataccgatc 


gacgcgccag 


aatcgcagcc 


240 


attcggagac 


aactgaaaaa 


agagcttgca 


tggaactggt 


agtagaaccc 


ctcaatfctgc 


300 


atctgaacgc 


ggagaccggc 


agcaccctgc 


ttgacgtgct 


caggtccaac 


gaggtcccca 


360 


tttcttatag 


ctgcatgtcg 


ggccgctgcg 


gcacttgccg 


ttgccgtgtg 


attgccggcc 


420 


atcttcgcga 


taacggcccc 


gagacagggc 


gcccgcaggc 


aggaaaggga 


acctacgttc 


480 


15tggcctgtca 


9gcggttctg 


accgaagact 


gcacgatcga 


gattcctgaa 


tctgacgaga 


540 


tcgtggttca 


cccggcgcgc 


atcgtcaagg 


ggacggtcac 


agcgatagac 


gaagccaccc 


600 


atgacatccg 


gcgcctgcgc 


atcaaactgg 


ccaaaccgct 


tgagttcagc 


cctggccagt 


660 


acgcaacggt 


gcagttcacg 


cccgaatgcg 


tccgcccata 


ttcgatggcc 


gggctgccta 


720 


gcgatgcgga 


aatggagttt 


cagattcgcg 


cggttccggg 


cgggcatgtc 


agcaactacg 


780 


20ttttcaatga 


actgtccgta ggcgcttcgg 


tgcggatcag 


cggccccctc 


ggaacggctt 


840 


atctgcggcg 


cacgcacacc ggccccatgc 


tttgtgtggg 


gggtggaaca 


ggtctggcgc 


900 


ccgtcctttc 


gatcgttcga 


ggcgcactgg 


aaagcgggat 


gagcaacccc 


atccatctgt 


960 


acttcggtgt 


gcggagcgag 


caggacatct 


atgacgagga 


acgccttcac 


gcattggctg 


1020 


caaggtttcc 


gaatctcaag gtgaatgtcg 


ttgttgcaac 


aggccctgcc 


ggccctggtc 


1080 


25gtcgatccgg 


cctggtcacc gatctgatcg 


gccgtgactt 


gcccaatttg 


gcgggatggc 


1140 


gcgcctacct 


gtgtggcgct ccggccatgg 


tcgaggccct 


gaacctgctc 


gttgctcgcc 


1200 


taggcatagt 


acccgggcac 


atccatgccg 


atgcgttcta 


tcccagcggc 


gtctgagcga 


1260 


aggcaccatg 


cgaacccaat 


tcaacccaag 


gataccaagc 


catgagtgaa 


ccccaacgat 


1320 


taaaacccgt 


gtttccccaa gatccgaaat 


ggccgggcga 


aggtagcagc 


cgcgttccct 


1380 


30tctgggccta 


cacccgcgaa gacctgtaca 


agcgcgaatt 


993gcgcctg 


ttctatgcaa 


1440 


accactggtg 


ctatgtaggc 


ctggaagccg 


agattccgaa 


tccaggcgac 


ttcaagcgaa 


1500 


cggtgatcgg 


tgagcgctcg gtcatcatgg 


tgcgtgatcc 


ggatggcggc 


atcaacgtgg 


1560 


tggagaacgt 


ctgcgcccac 


cgtggcatgc 


gcttttgccg 


cgagcgccac 


ggcaacgcca 


1620 


aggacttctt 


ctgcccctac 


caccagtgga 


actacagcct 


caagggtgac 


ctgcagggcg 


1680 


35tgcccttccg 


ccgtggcgtc 


aagcaggacg 


gcaaggtcaa 


cggcggcatg 


cccaaggact 


1740 


tcaaactcga 


agaacacggc 


ctgaccaagc 


tcaaggtggc 


cgcccgaggc 


ggtgcagtgt 


1800 


ttgcctcttt 


tgaccacgat gtcgagccct 


tcgaggagtt 


cctgggccca 


accatcctgc 


1860 


attacttcga 


ccgcgtcttc 


aacggccgca 


agctcaagat 


cctgggctac 


cgccgccagc 


1920 


gcatcccggg 


caactggaag ctgatgcagg 


agaacatcaa 


ggacccctac 


cacccgggcc 


1980 


40tgctgcacac 


ctggttctcg 


acctfccgggc 


tctggcgcgc 


cgacaacaag 


tcggaactga 


2040 



32 



agatggacgc caagttccgc 


cacgccgcaa 


tgatctccac 


gcgcggtcag ggcggcaaga 


2100 


acgaggaggt cgtgtccggc 


gtggacagct 


tcaaggaaca 


gatgaaggtg 


aacgacccgc 


2160 


gcctgctcga catcgtgccc 


gagccctggt 


ggggcggtcc 


gactgcggtg 


atgaccacga 


2220 


tcttccccag cgtgatcatc 


cagcagcagg 


tcaacagcgt 


atcgacccgc 


cacatccagc 


2280 


5ccaacggtca cggctccttc 


gatttcgtct 


ggacccactt 


cggcttcgag gacgacaacg 


2340 


aggagtggac ccagcgccgc 


ctgatccagg 


ccaacctgtt 


cgggccggcg 


ggcttcgtgt 


2400 


cggccgatga cggcgaggtg 


atcgagtggt 


cgcaggaagg 


ctttgagcaa 


aaaccgacgc 


2460 


accgcaccgt gatcgagatg 


ggcggtcacg 


aaatcggcga 


cacggaccac 


atggtcaccg 


2520 


agacgctgat ccgcggcatg 


tacgactact 


ggcgcaaggt 


gatgggggaa 


taaacatggt 


2580 


lOcgacttcaaa acctatttcg 


aactgctgaa 


cctgtacagc 


gactacgcca 


tggtgtgcga 


2640 


ctccgccaat tgggagaagt 


ggcctgattt 


cttcatcgag 


accggcacct 


accgcctgca 


2700 


gccgcgcgag aactttgagc 


agggcctgcc 


gctgtgcctg 


ctggcgctgg 


agagcaaggc 


2760 


aatgatccgc gaccgggtgt 


acggcgtcaa 


ggaaaccatg 


taccacgacc 


cctactacca 


2820 


gcgccacatc gtgggcacgc 


cgcgcgtgct 


gtcagtggag 


cgtgacgcgg 


atggcgagcg 


2880 


15catcaccgcc gaagccagct 


atgccgtgat 


tcgcaccaag 


tacgacggcg attccacgat 


2940 


tttcaacgcc ggctattacc 


gagacgtgat 


cgtgcgcacg 


cccgagggcc 


tcaagctgaa 


3000 


gtcgcgcctg tgcgtttacg 


acagcgaaat 


gatccccaac 


tctgtgatct 


atccaatctg 


3060 


aggcgacgca catgactcag 


aactggattg 


atgcagcttg 


tcttgacgac 


atccctgaag 


3120 


gcgatgtggt cggcgtcaaa 


gttaacggca 


aggaaattgc 


gctctacgag gtcgaggggg 


3180 


20agatttatgc caccgataac 


ctgtgcacgc 


acggcgctgc 


gcgcatgagc 


gatggctttc 


3240 


tcgaaggccg ggaaattgaa 


tgccctctgc 


accaaggtcg 


atttgatgtc 


tgcacaggca 


3300 


aagccttgtg cacccccctg 


acaaaggaca 


tcaaaaccta 


ccccgtcaaa 


attgaaaaca 


3360 


tgcgcgtgat gctcaaaatg 


gagtaagact 


gctttcaatc 


aggaggctga 


attcggacca 


3420 


atcaaccaaa ccatccaatc 


cacccattag ccaaaaaacg 


gagacattaa 


tcatgattta 


3480 


25tgaaaatttg gtgagtgaag 


cagggctgac 


gcaaaagcac 


ctgattcatg gcgacaaaga 


3540 


acttttccag cacgaattga 


agaccatctt 


cgcgcgaaac 


tggctttttc* tgacccatga 


3600 


cagcttgatt ccctcccccg 


gcgactatgt 


cacagccaaa 


atgggtgtcg 


atgaagtcat 


3660 


cgtctcccgc cagaacgatg 


gctcggtgcg 


agcctttttg 


aatgtttgcc gtcaccgggg 


3720 


caagacacta gttcacgctg 


aagccggaaa 


tgcgaaaggc 


tttgtgtgca 


gttaccacgg 


3780 


30ctggggcttt ggctccaacg 


gcgaactgca 


aagcgttccc 


tttgaaaaag 


agttgtacgg 


3840 


cgatacgatc aaaaagaagt 


gcctgggctt 


gaaagaagtc 


ccccgcatcg aaagctt fcca 


3900 


tggcttcatt tacggttgtt 


ttgatgcaga 


ggcccccacg 


cttgtcgatt 


atctgggtga 


3960 


tgcagcctgg tacttagaac 


ccatcttcaa 


gcactctggt 


ggcctggaac 


ttgtaggccc 


4020 


ccccggcaaa gtggtgatca 


aggccaactg gaaggctcct 


gcggaaaact 


ttgtgggtga 


4080 


35cgcgtaccac gttggttgga 


cgcacgcatc gtctttgcgc 


tcaggtcagt 


cgatatttac 


4140 


ccctcttgcg ggcaacgcta 


tgcttccacc 


cgaaggcgcg 


ggcttacaaa 


tgaccagcaa 


4200 


gtatggcagt ggaatgggcg 


tattgtggga 


cggctactcc 


ggtgtccaca gtgctgacct 


4260 


ggttcccgaa atgatggcat 


tcggcggcgc 


aaaacaggaa 


aaactcgcca 


aggaaatcgg 


4320 


cgatgtccgg gcacggattt 


accgcagcca 


tctaaactgc 


acggttttcc 


cgaacaacag 


4380 


40cattttgacc tgctccggtg 


tcttcaaggt 


ctggaacccg 


atcgatgaaa 


acacgaccga 


4440 
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ggtfctggacg 


tatgccatcg 


tagaaaaaga 


catgcctgag 


gacttaaagc gtcgcttggc 


4500 


tgacgcggtt 


cagcgcfactg 


tcggaccagc 


aggattctgg 


gaaagcgacg 


acaacgacaa 


4560 


catggagacg gagtcgcaaa 


atgccaagaa 


ataccaatcc 


agcaacagtg 


atctgattgc 


4620 


caatttgggt 


ttcggcaagg 


acgtctacgg 


cgacgaatgc 


tatccgggcg 


tcgttgccaa 


4680 


5atcggcaatc ggcgaaacca gctatcgcgg 


attctaccgt 


gcctaccagg 


ctcacatcag 


4740 


cagctccaat 


tgggccgagt 


tcgaaaacac 


ctcccgaaat 


tggcacaccg 


aactcaccaa 


4800 


gacgactgat 


cgctaatcca 


ggagccaacc 


atgatgatca 


atacccagga 


agacaagctg 


4860 


gtctccgcgc 


acgacgccga 


agaatttcat 


cgtttcttcg 


tcgggcacga 


cagcgatctg 


4920 


cagcaagaag 


tcaccacact 


cctgacccga 


gaagcacatc 


tgttggacat 


tcaggcctac 


4980 


lOaacgcctggc 


ttgaacactg cgttgccccc 


gagatcaaat 


accaagtgat 


ctcgcgagaa 


5040 


tttcggtcca 


cttccgagcg 


tcgataccaa 


ctgaatgatg 


cggtgaacat 


ctacaacgag 


5100 


aactatcagc 


acctgaaagt 


tcgagttgag 


catcagatgg 


acccgcagaa 


ctgggccaac 


5160 


agcccgaaga 


tccgtttcac 


ccgcttcgtc 


accaatgtca 


cagcggccaa ggacaagatc 


5220 


gtaccggatc 


tgctgcatgt 


gcgttccaac 


ctcattctcc 


accgcgccag 


acgcggcaat 


5280 


IScaagttgacg tcttctatgc 


aacgcgggaa 


gacaaatgga 


aacgcatcga 


aggtggtggc 


5340 


atccaactgg 


tggaacgtct 


tgtggactac 


ccggagcgca 


ttctccagac 


ccacaatctg 


5400 


atgaccttcc 


tgtgaaccct 


ggggatgcct 


gccttgatgg 


cggtcatcct 


tgattgtttt 


5460 


aaacagaaat 


ttattgccat 


gaacatacag 


caagttattg 


ccattactgg 


cgccggttca 


5520 


ggcatcgggc 


tggaactggt 


tcgatccttc 


aaagcagctg 


gctattgcgt 


gtccgcactt 


5580 


20gttcgcaacg aggaacaaga ggcgggcctt 


cgcagtgaat 


tcaaagatgc 


cattgagatc 


5640 


gtagcgggcg 


atgtttgtga 


tcacgccacc 


aatgaaaagc 


tggtcaacaa 


ggcggtcgcc 


5700 


aggttcggac 


acctcgactg 


cttcatcgga 


aatgccggga 


tatgggatta 


catgctgggc 


5760 


gtcgacgagc 


catgggagaa attatccggc 


agtttcgagg 


agatatttga 


tatcaacgtc 


5820 


aaaagctatt 


tcagcggcat 


cagcgcggcc 


ttgccggaac 


tcaaaaaaac 


gaacggatcg 


5880 


25gtc9tagtga 


cggcttccgt 


ttcttcctat 


gcggccggcg 


gcggcggctc 


ttgctacatt 


5940 


gccagcaagc 


atgcagtgct 


gggtatggtc 


aaggcgttgg 


cctacgaatt 


ggccccgcac 


6000 


atccgggtca 


atggcgttgc 


gccaggtggt 


acggtcactt 


ctttggctgg gccggcaagc 


6060 


gccggtttcg 


acaaaaccaa 


aatgaaagac 


atgcccggca 


tcgatgacat 


gatcaagggc 


6120 


ctgaccccct 


tggggttcgc 


agcaaggccc 


gaggacgtgg 


tggcaccgta 


tcttttgctg 


6180 


SOgcctcccggg 


aacaagggaa 


gttcatcact 


ggcaccgtaa 


tcggcattga 


tggcggcatg 


6240 


gcgctcggtc gaaagtgaat 


tttcaatcaa 


atcagatttt 


tcaaccccat 


tcccaggaga 


6300 


caacccatga 


agacgaaatt 


gttcatcaac 


aacacctgga 


gcgcttcgag 


tgacaaaaag 


6360 


tcattcgatc gcaagcaccc 


tgtcagtggc 


gaggtcgtga 


cccaatgcgc gaacgccacg 


6420 


gtggacgatg 


cggtcaatgc 


ggctcgagcc 


gctcaagagg 


cgttcaagtc 


ctggaaggcc 


6480 


35gtcggaccct 


cggagcggcg 


gcgccttctt 


ttgaaggtgg 


cagacgtcat 


ggagagcaaa 


6540 


acgcccgagt 


tcatcgaagt 


gatggccaag 


gaagtgggag 


cctccgcgct 


gtgggcgggg 


6600 


ttcaacgtgc 


acctgtcggc 


caatgtattc 


cgggaagccg 


cctcactggc 


cacccaaatt 


6660 


caaggcgaaa ccattccgac ggacaagcct 


gacaccctgt 


caatgacgct 


gcgtcacgct 


6720 


gtcggcccca tcttgagcat 


cgttccctgg 


aacggcaccg 


ccgtgctcgc ggcgcgggcc 


6780 


40atcgcttatc cgctggtctg cggcaatacc 


gttgtgttca 


aaggctccga gttcagcccc 


6840 
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ggtacgcacg ctttgatcac 


caagtgcctg caggaggccg acctgcctgc 


tggcgtgctc 


6900 


aactatctga actcctcccc 


ggaccggtcg 


cccgatattg 


cagatgcgct 


gatttcgtct 


6960 


aaagagattc gtcgcatcaa 


cttcacaggc 


tccactcgcg 


tggggcgcat 


catcgcccag 


7020 


aaatcggccc agcatctcaa 


gcgctgcttg 


ctggagttgg 


gtggaaagtc 


cccgctgatc 


7080 


5gttctggacg acgctgacat 


cgacgcggca 


gtcaaggccg 


cggtgttcgg 


cagcttcctg 


7140 


ttccaaggcc agatctgcat 


gtccaccgaa 


cgcctggtgg 


tcgacgaaaa 


gatcgcggac 


7200 


gaatttgtcg cgaagttcgt 


cgagaaaacc 


aagcagttga 


gcgcaggcga 


tccatgcgtc 


7260 


acaggggact gcatcatcgg 


cccgatggtg 


tcgcccaact 


cgggtgaccg 


aatcaatggt 


7320 


ctgttcaaag atgccatcag 


caagggcgcc 


aaggtcgtgt 


gcggcggcat 


cgccgagggt 


7380 


lOgcggtcatgc ccgccacgat 


cttggaccac 


gtgacagccg 


acatgcagat 


ctacgatgag 


7440 


gaaaccttcg gtcccatcac 


tgtggttatc 


cggtgcaaga 


gcgaagcgga 


cgccatccgc 


7500 


attgccaatg acagcgccta 


cggcctgtca 


tcgggcgtgt 


ttggccggga 


cgtgaaccgg 


7560 


gctctgcgcg tgggcatggc 


gatcgaatac 


ggctcggtcc 


atatcaacgg 


ctccaccgta 


7620 


cagaacgagg ctcaggcgcc 


ttatggcgga 


acaaaggcca 


ccggttatgg 


ccgcttcgac 


7680 


15ggacgcgcgg tgatcgacga 


gttcacggaa 


ctcaagtggc 


tgaccattga 


accattcgag 


7740 


cagcagtatc ccttctaagc 


tgaagcaaca 


aaggagttaa 


accatgaaca 


agccagcaac 


7800 


tgtcattgaa ttggggtaca 


tgggcatttc 


ggtcaaggat 


cccgcagcgt 


ggaaatcctt 


7860 


tgccgcaaac atgctgggac 


ttcaagtcct 


cgatgagggt 


gacaaggatc 


gcttctatct 


7920 


gcgaatggac aattggcacc 


atcggatcgt 


ggttcatcac 


aacggtcaag 


atgaccttga 


7980 


20atacctgggc tggcgtgtcg 


ccggtcaacc 


ggaattcgag 


gcafctgggtc 


aaaagctcgt 


8040 


ggacgcaggc tacaaagtcc 


gcgtgtgcga 


caaagccgaa 


gcacaagaac 


ggatggtgct 


8100 


gggcctgatg aagacagaag 


atccgggggg 


caacccgacc 


gagattttct 


ggggaccccg 


8160 


gattgacctg aacaacccct 


tccatcccgg 


tcgtcccttg 


cacgggaaat 


ttctaaccgg 


8220 


tgatcagggc ctgggccact 


gcatcgtgcg 


tcagaacgat 


gttgaagcgg 


cacgtaagtt 


8280 


25ctatagcttg ctgggatttc 


gtggagatgt 


cgagtaccgc 


cttcctttgc 


ccaacggcat 


8340 


gacggctgag ttgacgttca 


tgcattgcaa 


tgctcgcgat 


cattcoatcg 


ctttcggtgc 


8400 


aatgcctgcg gccaagcgcc 


tcaatcatct 


gatgattgaa 


tacactcata 


tcgaagattt 


8460 


gggttgcaca caccagcttt 


tcacgaagga aaagattgac 


attgccttgc 


aattgggcat 


8520 


ccattccaac gataaggcgc 


tgacgttcta 


cggggcaaca 


ccttccggct 


ggctgataga 


8580 


30acctgggtgg cgaggcgccc 


ccgccattgc 


tgaatcggaa 


tattacgtcg 


gcgacatttt 


B640 


cggccacacc atcgaggcca 


ccggttatgg 


attggacgtc 


aaactgagct 


agccatgtaa 


8700 


cagatgcgaa atcgatcgca 


tctgttttcc 


ccattcagtt 


ccatataaaa 


aaaaggagac 


8760 


aatgatgatc aaagaagcca 


tttccctcgc 


aggacttggg 


atgctgatgc 


tcagcaccgc 


8820 


atatgccgag gattcccgtt 


ggtcatatcg 


catcggcgcc 


accaacgtag 


ccttcgatgc 


8880 


35gagcgccaaa gtttcgattg 


acggaacaag ggtgccgggt 


ggaagcgctg 


acgccagtga 


8940 


caacaacgct ttgacatttg 


acttcggtta catcattaac gataactgga 


acgcgcgatt 


9000 


aattgttggc attccaccca 


ccacaaaagt 


gacgggcgca 


ggcacgctgc 


ctccgatttt 


9060 


gctgggccgt gtccaatatg 


ctcctgcagt 


tttgtcggcg 


acctacaacc 


tgccacagat 


9120 


gggattggtt cgcccgtacg 


tgggggcggg gatcaactac actcgaattc 


tgaaaagcaa 


9180 


40agatgccaat ctgacctcat 


tcgatgcaga 


tcatgcgtgg gcacttgtgc 


tccacatcgg 


9240 
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tgcagaagcg aacatcaacc 


gcgactggtt 


cgtcagcttt 


gatatccgaa 


aactttatct 


9300 


aaaaacagat gcatcggggt 


ttcttgggcc 


tcaagttgct 


acagcccgcg 


taacgttgaa 


9360 


tccgctgctg acgtcgattg 


cgattggccg 


gagattctga 


tcggtccaca 


ttgattccaa 


9420 


aattctgttt gcatcaccat 


tttcaaagga 


aatttgaatg 


acaagaaaga 


cgagcaaagc 


9480 


Sggtgcgcctg accgccgcgg 


atattcaagg 


cgcatgggtc 


atcatgccga 


ccccgtccac 


9540 


gccggatgcc tcggactggc 


gcagcacgca 


cacggtcgat 


ctcgacgaga 


cggcccggat 


9600 


tgtcgaggag ttgattgcgg 


ccggcgtcaa 


tggcattctg 


agccacggca 


cctttggcga 


9660 


atgcgcgacg ctgacgtggg 


a99a-9aagcg 


ggattttgtt 


tcaacggtcg 


tggaaaccgc 


9720 


gcgcggtcga gtgccctact 


tctgcggcac 


aacggccttg 


aatacccgtg 


aagtcatacg 


9780 


lOccagacccgc gaattgatcg 


acattggcgc 


ccaaggaaca 


atgctcggcg 


tgccgatgtg 


9840 


ggtgaagatg gatctgccta 


ctgccgtgca 


attttatcgc 


gatgtggcgg 


aagcagtgcc 


9900 


agatgcagcc atcgctgtct 


acgccaaccc 


ggaggctttc 


aaatttgatt 


ttcctcgccc 


9960 


gttttgggcc gaaatgtcca 


aaatcccgca 


ggttgtcaca 


gccaagtact 


tgggcatcgg 


10020 


gatgctggac ttggatctga 


aattggcccc 


aaatattcgc 


ttccttccgc 


atgaggatga 


10080 


ISctactacgct gcggcccgga 


tcaatcccga 


gcgcatgact 


gctttctggt 


ctagcggttc 


10140 


catgtgcggc ccagcgaccg 


cccttgtgct 


gcgcgatgag 


gtggtaaagg 


ccaaaaatac 


10200 


a-^gtgattgg gccaaggcca 


aggctatttc 


agatgacatg 


cgcgcagccg 


atgccacact 


10260 


gtttccacgc ggcgatttct 


cggaattctc 


aaaatacaac 


attggcctcg 


aaaaagcacg 


10320 


aatggacgag gccggctggc 


tcaaggcggg 


gccgtgccgg 


ccaccctata 


cgctggttcc 


10380 


20cgacgaatac cttgcaggtg 


cccgaaaatc 


aggcaaggcc 


tgggccgcac 


tgcataccaa 


10440 


gtatgccaag gaattgagga 


aaaccaaaac 


ggcaaccaac 


tcgaaaaaga 


agtaagtcca 


10500 


ggccctgagt cagacatctc 


cgatcagcac 


aacctgctga 


tctggaggtt 


ttctggatta 


10560 


gtgcagtcgg cgagttaaaa 


atatgccagt 


acagagtgga 


agcgccacaa 


gcgcagggtg 


10620 


caccgatcgc cctgaagcaa 


tctcacacta 


ttgccatctt 


ctctttgttg 


ggagtgcatg 


10680 


25atcgtgatgg tcgattttta 


tttcgatttt 


ttgagcccat 


tttcgtatct 


ggccaaccac 


10740 


cgtttgtcgg tgctcgccgg 


gcgttatgga 


ttctccatcc 


agtatcacgc 


cattgatttg 


10800 


gcgcgagcaa aaacggccat 


tggcaacatc 


gggccatcca 


atcgggacct 


caaggtcaag 


10860 


cttgactact taaaggtgga 


tttgcagcga 


tgggccgatc 


tctataggat 


tccgttggtt 


10920 


ttccccccta acttcaacag 


ccgccgggtg 


aatgccggac 


tgtattaccc 


ggcagccagg 


10980 


30gagcgagccg ctgaatatgt 


tcgccttgtt 


ttcgattcgg 


cttgggggaa 


agggtgggca 


11040 


ctggatgctg atagcttgct 


ggctgaggta 


tgcgacaagc 


taaactggga 


tctcggtgaa 


11100 


tttgaagatt ttttgaacag 


cgaaaatgcc 


gccaaggcat 


acgacgaaga 


gacgcaggcg 


11160 


gccattgacc gaaaggtttt 


cggggttccc 


accgtgtttt 


gggatgatca 


aatgtggtgg 


11220 


ggaaatgacc gccttttcat 


gcttgagagc 


aggttgcaaa 


aggaaacgca 


accataaatt 


11280 


35ctcagtgcat cgatttttct 


gatttttcaa 


ccacgccttg 


attttaagga 


gtctcatgaa 


11340 


gctttattac agccccggcg 


catgttcatc 


gtcgcctcac 


atcatccttc 


gtgaaggtgg 


11400 


atttgacttt cagctagaaa 


aggttgatct 


cggcaccaag 


gtgactgaga 


ctggtgttga 


11460 


ttacaagacc gtcaatcccg 


tgggcagtgt 


ccccgctttg 


caaatggatg 


atgggcaggt 


11520 


gctcaccgaa gggccggcca 


tcgttcaata 


cctcgctgac 


cgtgtgccag 


aaaaatgcct 


11580 


40ggcaccggca gctggctcgc 


tggagcgtta 


ccgactgatg 


gaatggctga 


atttcatctc 


11640 
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caccgaattg cataaaagct tcggagcgct gttcagcccg gtgtttccac aggatgccaa 11700 

gccagtcatc aaggcccaat tggaaagccg tcttgcccat accgagcaga tgctcggtga 11760 

caaggtttgg gccatgggga acgacttttc cgtggtggac gcctacttgt ttactgtgct 11820 

tggctggggg gcttacgtga atgtggacct ttcgccttgg cccggcctgc aaggctacct 11880 

5caaccgcgtg gctgaacgtc cagcagtccg ggccacattg tcggctgaag gtttgatctg 11940 

atttttttag agaaataaat aaagcatggc cagcctccat gagcggccag ccatgccgcg 12000 

gggcagccgt gggcagcggc ctcgatcaaa tttcgatgca cacagattac gagccccttt 12060 

ttccgactgc ggtcggattc agaaattgcc gatacttgct ttggtctgat cgaagcgtga 12120 

gggatatcgg cttcccctat ccacatgctt gatgaagagg agcgtatcac catgagtcac 12180 

lOgaacttggcc gactggaaga cctgccgcag gactaccgag acgaactcaa acaacttaac 12240 

ctggtgccgc tatggcccag cctacgcgcg gtgctgccgc ccaatgtccc gacccgccag 123 00 

acgcagccga cttactggtc ctaccagacg ctcaagccgc tgctgctcaa ggccggtgag 12360 

ctgaccccca ttgagaaggc cgagcgccgc gtgctggtgc tggccaaccc cggccacggc 12420 

ctggagaaga tgcaagccag cgccgccata tacctgggca tgcagttgct gctgcccggc 12480 

15gagtgggcgc ccagccaccg ccacaccccc aacgcggtgc gcatgatcgt ggagggcgag 12540 

ggcgcctaca ccaccgtgga tggcgagaag tgccccatga gccggggtga cctcatcctc 12600 

acgcccaccg gcctgtggca cgagcacggc cacgacggca acgagcccgt ggtgtggctc 12660 

gatgtgctgg acctgccgct ggtgtactac atggaggcca gctaccacat cgacggcgag 12720 

cgccagcagg tcgaccccgg ccggggcgac tgcgcctgga cccgtgcggg cgtggtgcca 12780 

20acccccgtgt tccagcgcag cgacaagcgc tatcctctct tgcgctaccc ttgggccgac 12 84 0 

acccgtgccg ccctgctgtc gctggcggcc gaccagcctg agcaggaatg cgtgcaggtc 12900 

acctacgtca accccgaaac gggtgacgac gccgagaaca tcctgggctt ctacgccctg 12960 

atgctcaagc ccggccagac cctgcgcctg cccgtgcgtt cgcccgccgt ggtgttccac 13020 

cagatcgaag gccgcagcga ggcgcgcatc gccgagtcca ccttcgccct gagggaagcc 13080 

25gatacctgct gcgcccccgg ctacaccgag gtgacgctga aaaacctctc agccgaccag 13140 

ccgtccttca tcttcatggc cgacgaatcg cccctgcacc gcaagctggg cgtctttgaa 13200 

aaccgcggct gagccgccgc gaagcaccac acaacaggaa agcaagtgag caactacctc 13260 

tggaacccgc ctcccgtcca gtctctgccc gttcgtggca agaccgagcg cttccccatc 13320 

aaccgcatct tctgcgtcgg ccgcaactac cacgcccacg cggtggaaat gggccgtccg 133 80 

30gtcgacaaaa gcgtcgaaca agcgttctac ttcaccaaat cgccacaaac cctggtggaa 13 4 40 

agtggtgcga ccgtggccta cccgccgcgc accagcaact accactacga gatggagctg 13 500 

gtgctggcga tcggcaagcc cggcttccgc gtcagcgaag accaggcgca cgagttgatt 13560 

tacggctacg ccgccggcct ggacatgacc cggcgcgacc tgcaactggt ggcacgcgac 13620 

aagggccgcc cctgggacac cggcaaggac atcgaggaag gctcggtctg ctccgagatc 13680 

35gtgccgatgc aaggcgtggt ggtggagcag ggcgcgatcg ccctggaagt caacggccag 13740 

accaaacagt cgtccaacgt ggacaagctg atctggaacg tccgcgagat cattgccgac 13800 

ctctccacct actaccactt gcaacccggc gacctgattt acaccggcac gcccgaaggc 13 860 

gtgggcgccg tggtggcggg tgacaagatc atcggccgtg tggaaggcat tgccgagatc 13920 

agcctgaccg tcggcccggc cgagtgagcc tgcacgcgat gaagctgtac aacttctggc 13980 

40gcagcggcac gtcacaccgg ctgcgcatcg cgctcaacct caagggcgtg ccctacgagt 14040 
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acctggccgt gcacctgggc aaggaagagc acctaaagga cgccttcaag gcgctgaacc 14100 

cacagcagtt ggtgcccgcg ctggacacgg gcgcgcaagt gctgatccag tcgccggcca 14160 

tcatcgaatg gctggaggaa cagtacccca cgcccgcgct gctgccggcc gacgccgacg 14220 

gccgccaacg ggtgcgcgcg ctggccgcca tcgtgggctg cgacatccac cccatcaaca 14280 

5accgccgcat tctggagtac ctgcgcaaga cgttcggcgc cgacgaggcc gccatcaacg 14340 

cctggtgtgg cacctggatc agcgccggtt tcgatgccta cgaagcactg ctggcggtcg 14400 

acccgaagcg cggccgctac agctttggcg acacacccac gctggccgac tgctacctgg 14460 

tgccgcaggt ggaaagcgcc cgccgcttcc aggtggacct gacgccctac cccctgatcc 14520 

gcgcagtgga cgcggcctgc ggcgagttgg acgcattfccg gcgcgccgcg ccagctgctc 14580 

lOaacctgattc ggcttgacga aaaatacgcc gccttcgccc taataaacag ttcgtagccc 14 640 

tgcccagtga caagtatcaa tgagcaatag tgaaaatgta aggggaaaga ttatgaaaca 14700 

tattctgacc cgacgagcag ccctgagcac tttgggctcc cttggaatcg ggggccttcc 14760 

gggcatgagc ttggcgcaat ccggccctgg cgtggccacg atcgtgattg gactcgccgc 14820 

^99cggagcg accgatatgg ctgcccgccg gttgtctgag ggaatgcgcg gtgcctatgc 14880 

15ttccagtgtg ctcgtcgaca accgcacggg ggcgggtggt cgaattgcga tccagcatgt 14 940 

taaggccgcg gcaccaaacg gcgcgacact fcctgctgacg cctgcatcca tgatgacgct 15000 

ataccctcac acgtacaaag atct 15024 



<210> 12 
<211> 4912 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> A modified DNA sequence 
<400> 12 

30 gagctcgttg cgcaagcgct tccatcagtg ggggcatgaa gtacatctcg ccgatgtcgg 60 
tcattgccaa gttgaaggtg cgcgtgctgg caaatgggtc gaaagagtca cgggtcgtca 120 
gtgccgtctg cagcgtgttg agcgcataga ccacgggctc cgcaagatgc agtgcatacg 180 
gtgtcggctc catgcctttt gaggtgcgca agaacaaatc gtcctttagc gccgcacgca 240 
gccgtttaag tgaattgctg acggcaggct gcgtcagccc cagtttttcg ccggccgtcg 300 

35 atacgctccg gtcgagcagt agctggttga agaccaccag cagattcaag tcgatgtcgc 360 
gcagatccat gatgcctcac cattattcat gctggtgatt ttaactatca gacttgatct 420 
atagcgctat accgatcgac gcgccagtat cgcagccatt cggagacaac tgaaaaaaga 4 60 
gcttgcatgg aactggtagt agaacccctc aatttgcatc tgaacgcgga gaccggcagc 540 
accctgcttg acgtgctcag gtccaacgag gtccccattt cttatagctg catgtcgggc 600 

40 cgctgcggca cttgccgttg ccgtgtgatt gccggccatc ttcgcgataa cggctccgag 660 
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acagggcgcc cgcaggcagg aaagggggcc tatgtcctgg cctgtcaggc ggttctgacc 720 

gaagactgca cgatcgagat tcctgaatct gacgagatcg tggttcaccc ggcgcgcatc 780 

gtcaagggga cggtcacagc gatagacgaa gccacccatg acatccggcg cctgcgcatc 840 

aaactggcca aaccgcttga gttcagccct ggccagtacg caacggtgca gttcacgccc 900 

5 gaatgcgtcc gcccctattc gatggccggg ctgcctagcg atgcggaaat ggagtttcag 960 

attcgcgcgg ttccgggcgg gcatgtcagc aactacgttt tcaatgaact gtccgtaggc 1020 

gcttcggtgc ggatcagcgg ccccctcgga acggcctatc tgcggcgcac gcacaccggc 1080 

cccatgcttt gtgtgggggg tggaacaggt ctggcgcccg tcctttcgat cgttcgaggc 1140 

gcactggaaa gcgggatgag caaccccatc catctgtact tcggtgtgcg gagcgagcag 1200 

10 gacatctatg acgaggaacg ccttcacgca ttggctgcaa ggtttccgaa tctcaaggtg 1260 

aatgtcgttg ttgcaacagg ccctgccggc cctggtcatc gatccggcct ggtcaccgat 132 0 

ctgatcggcc gtgacttgcc caatttggcg ggatggcgcg cctacctgtg tggcgctccg 13 80 

gccatggtcg aggccctgaa cctgctcgtt gctcgcctag gcatagtacc cgggcacatc 1440 

catgccgatg cgttctatcc cagcggcgtc tgagcgaagg caccatgcga acccaattca 1500 

15 acccaaggat accaagccat gagtgaaccc caacgattaa aacccgtgtt tccccaagat 1560 

ccgaaatggc cgggcgaagg tagcagccgc gttcccttct gggcctacac ccgcgaagac 1620 

ctgtacaagc gcgaattgga gcgcctgttc tatgcaaacc actggtgcta tgtaggcctg 1680 

gaagccgaga ttccgaatcc aggcgacttc aagcgaacgg tgatcggtga gcgctcggtc 1740 

atcatggtgc gtgatccgga tggcggcatc aacgtggtgg agaacgtctg cgcccactgt 1800 

20 ggcatgcgct tttgccgcga gcgccacggc aacgccaagg acttcttctg cccctaccac 1860 

cagtggaact acagcctcaa gggtgacctg cagggcgtgc ccttccgccg aggcgtcaag 1920 

caggacggca aggtcaacgg cggcatgccc aaggacttca aactcgaaga acacggcctg 1980 

accaagctca aggtggccgc ccgaggcggt gcagtgtttg cctctttcga ccacgatgtc 2040 

gagcctttcg aggactgtgc gtgtacgaca gcgagatgat ccccaactcc ctcatctacc 2100 

25 ccatttgagg ttgcagaaca tgtccgagaa ctggattgat gccatcgcac gggacgctgt 2160 

gcctgagggc gatgtggtcg gagtcatcgt ggcaggcaaa gacattgcct tctatgaggt 2220 

ggaaggtgag gtcttcgcca ccgacaactt gtgtacccac ggggctgcgc gcttgagcga 2280 

cggctttctc gaaggccggg aaattgaatg tcctttgcat caaggccgat tcgatgtttg 2340 

cacgggtaaa gccttgtgca cacccctgac acaggacatc aaaacctacc ccgtaaaaat 24 00 

30 cgaaaacatg cgcgtgatgc tcaagctgga ctaaaactct ttgcaggagg aaagccaaat 2460 

ccggaaatca ccccacccaa cccaatcact acccgttttc aaacaagagg agataagcaa 2520 

ttatgagtta ccaaaactta gtgagtgaag cagggctgac gcaaaagctc ctgattcatg 2580 

gcgacaaaga acttttccag cacgaattga agaccatctt cgcgcggaac tggctttttc 2640 

tgacccatga cagtctgatt ccctcccccg gcgactatgt cacagccaaa atgggcgtcg 2700 

35 atgaagtcat cgtctcccgc cagaacgatg gctcggtgcg agcctttttg aatgtttgcc 2760 

gtcaccgggg caagacacta gttcacactg aagccggaaa tgcgaaaggc tttgtgtgcg 2 820 

gctaccacgg ctggggctac ggttccaacg gcgaactgca aagcgttccc tttgaaaaag 2880 

agttgtacgg agatgcgatc aaaaagaaat gcctgggctt gaaagaagtc ccccgcatcg 2 940 

aaagctttca tggctttatc tatggctgtt ttgatgcaga agctcccccg ctcatcgatt 3000 

40 atctgggtga tgcagcctgg tacctggaac ccaccttcaa gcactctggt ggcctggaac 3060 
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ttgtaggccc ccccggcaaa gtggtggtta aggccaactg gaagcctttt gcggaaaact 3120 

ttgtaggtga catctaccac gttggttgga cgcacgcagc ggctttgcgc gcagggcagt 3180 

cggtatttag ttctcttgcg ggcaacgcta agcttccacc cgaaggcgcg ggcttgcaaa 324 0 

tgaccagcaa gtatggcagt ggaatgggct taacgtggga ctactactcc ggtaacttca 33 00 

5 gcgctgatat ggttcccgat ctgatggcat tcggcgccgc aaaacaggaa aaactcgcca 3360 

aggaaatcgg cgatgtccgg gcacggattt accgcagcat tctgaacggc acggttttcc 3420 

cgaacaacag ctttttgacc ggctccgcta ccttcaaggt ctggaacccg atcgatgaaa 3480 

acacgaccga ggtttggacg tatgccttcg tagaaaaaga catgcctgag gacttaaagc 3540 

gtcgcttggc tgacgcggct cagcgcagtg tcggaccagc aggattctgg gaaagcgacg 3600 

10 acaacgaaaa catggagacg ttgtcgcaaa atgccaagaa ataccaatcc agcaacagtg 3660 

atcagattgc cagtttgggt ttcggcaagg acgtctacgg cgacgaatgc tatccgggcg 3720 

tcgttggcaa atcggcaatc ggcgaaacca gctatcgcgg attctaccgt gcctaccagg 3780 

ctcacatcag cagctccaat tgggccgagt tcgaaaatgc ctcccgaaat tggcacaccg 3840 

aactcaccaa gacaactgat cgctaatcca ggagccaacc atgatgatca atacccagga 3900 

15 agacaagctg gtctccgcgc acgacgccga agaatttcac cgtttcttca tcgtacaaga 3960 

tgatgcacta ctgcaagaag tcaacacgct cctgacccgc gaagcgcacc tgctggacat 4020 

tcaggcctac aaagcctggc ttgaacactg cgttgccccc gagatcaaat accaagtgat 4080 

ctcgcgagaa cttcgctcca cttccgagcg tcgataccaa ctgaatgatg cggtgaatct 4140 

ctacaacgag aactatcaac agctgaaagt tcgagttgaa caccagatgg atcctcagaa 4200 

20 ctgggccaac aacccgaaga tccgcttcac ccgcttcgtc accaatgtca cggcggccaa 4260 

ggacaagagc gcaccggaaa tgctgcatgt gcggtccaac ctcattctcc atcgcgccag 4320 

acgagaaaac caagttgacg tcttctatgc aacgcgtgaa gacaaatgga aacgcatcga 4 380 

aggtggtggt atcaaattgg tcgaacgctt tgtggactac ccggagcgca ttccccagac 4440 

ccacaacctg ttggtcttcc tgtgaaccct ggggatgcct gcctggatgg cgggcattcg 4500 

25 tgattatttt taacagaaat ttattgccat gaacacacag caagttgttg ccatcactgg 4560 

cgccggctcg ggcattggtt tcgagttggt tcgctctttt aaggcagccg gttatcgcgt 4620 

atctgcactc gttcgcaacg aggagcaaga ggcgggtctt cgcagtgaat tcaaagacga 4680 

cattgagatc gtggcgggcg atgtccgtga tcacgccacc aatgagaagc tggttaaaca 4740 

gacggttgcc aagttcgggc gcctggattg cttcatcgga aatgccggga tatgggatta 4800 

30 catgctgagc atcgatgatg cttgggagaa attctcgggc agtttcgacg agatatttga 4860 

catcaacgtc aaaagctatt tcagcggcat cagcgcggcc ttgccggagc tc 4912 

<210> 13 
<211> 6779 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 

40 
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<400> 13 

atgcatacgt agcggtjggcg aaagaggcgc cgctggaaga atccggtctg tagctctggc 60 

agaagaccca aggcgagatc aaccgcaccg gactccatat cctccttcag attgccagca 120 

ttcgggcgca gcgtgctgat ctggatgtga ggagctcgtt gcgcaagcgc ttccatcagt 180 

^gggggcatga agtacatctc gccgatgtcg gtcattgcca agttgaaggt gcgcgtgctg 240 

gcaaatgggt cgaaagagtc acgggtcgtc agtgccgtct gcagcgtgtt gagcgcatag 300 

atcacgggct ccgcaagatg cagtgcatac ggtgtcggct ccatgccttt tgaggtgcgc 360 

aagaacaaat cgtcgttcag cgctgtacgc agccgtttaa gtgaattact gacggcaggc 420 

tgcgtcagcc ccagtttttc gccggccgtc gatacgctcc ggtcgagcag tagctggttg 480 

lOaagaccacca gcagattcaa gtcgatgtcg cgcagatcca tgacctcacc ctcaccatta 540 

ttcatgctgg tgattttaac tatcagactt gatctatagc gctataccga tcgacgcgct 600 

agaatcgcag ccattcggag acaactgaaa aaagagcttg catggaactg gtagtagaac 660 

ccctcaattt gcatctgaac gcggagaccg gcagcaccct gcttgacgtg ctcaggtcca 720 

acgaggtccc catttcttat agctgcatgt cgggccgctg cggcacttgc cgttgccgtg 780 

IStgattgccgg ccatcttcgc gataacggcc ccgagacagg gcgcccgcag gcaggaaagg 840 

gggcctatgt cctggcctgt caggcggttc tgaccgaaga ctgcacgatc gagattcctg 900 

aatctgacga gatcgtggtt cacccggcgc gcatcgtcaa ggggacggtc acagcgatag 960 

acgaagccac ccatgacatc cggcgcctgc gcatcaaact ggccaaaccg cttgagttca 1020 

gccctggcca gtacgcaacg gtgcagttca cgcccgaatg cgtccgcccc tattcgatgg 1080 

20ccgggctgcc tagcgatgcg gaaatggagt ttcagattcg cgcggttccg ggcgggcatg 1140 

tcagcaacta cgttttcaat gaactgtccg taggcgcttc ggtgcggatc agcggccccc 1200 

tcggaacggc ctatctgcgg cgcacgcaca ccggccccat gctttgtgtg gggggtggaa 1260 

oaggtctggc gcccgtcctt tcgatcgttc gaggcgcact ggaaagcggg atgagctacc 1320 

ccatccatct gtacttcggt gtgcggagcg agcaggacat ctatgacgag gaacgccttc 1380 

25acgcattggc tgcaaggttt ccgaatctca aggtgaatgt cgttgttgca acaggccctg 1440 

ccggccctgg tcatcgatcc ggcctggtca ccgatctgat cggccgtgac ttgcccaatt 1500 

tggcgggatg gcgactgcat cctgtgtggc gctccggcca tggtcgaggc cctgaacctg 1560 

ctcgttgctc gcctaggcat agtacccggg cacatccatg ccgatgcgtt ctatcccagc 162 0 

ggcgtctgag cgaaggcacc atgcgaaccc aattcaaccc aaggatacca agccatgagt 1680 

30gaaccccaac gattaaaacc cgtgtttccc caagatccga aatggccggg cgaaggtagc 1740 

agccgcgttc ccttctgggc ctacacccgc gaagacctgt acaagcgcga attggagcgc 1800 

ctgttctatg caaaccactg gtgctatgta ggcctggaag ccgagattcc gaatccaggc 1860 

gacttcaagc gaacggtgat cggtgagcgc tcggtcatca tggtgcgtga tccggatggc 1920 

ggcafccaacg tggtggagaa cgtctgcgcc caccgtggca tgcgcttttg ccgcgagcgc 1980 

35cacggcaacg ccaaggactt cttctgcccc taccaccagt ggaactacag cctcaagggt 2040 

gacctgcagg gcgtgccctt ccgccgtggc gtcaagcagg acggcaaggt caacggcggc 2100 

atgcccaagg acttcaaact cgaagaacac ggcctgacca agctcaaggt ggccgccccg 2160 

aggcggtgca gtgtttgcct ctttcgacca cgatgtcgag cctttcgagg acttcctggg 2220 

ccaaccatcc tgcactactt cgatcgcgtc ttcaatggcc gcaagctcaa gatcctgggc 2280 

40taccgccgcc agcgcatccc gggcaactgg aagctgatgc aggagaacat caaggacccc 2340 
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taccacccgg 


gcctgctgca 


cacctggttc 


tcgaccttcg 


ggctctggcg 


cgccgacaac 


2400 


aagtcggaac 


tgaagatgga 


cgccaagttc 


cgccacgccg 


caatgatctc 


cacgcgcggt 


2460 


cagggcggca 


agaacgagga 


ggtcgtgtcc 


ggcgtggaca 


gcttcaagga 


acagatgaag 


2520 


gtgaacgacc 


cgcgcctgct 


cgacatcgtg 


cccgagccct 


ggtggggcgg 


tccgactgcg 


2580 


Sgtgatgacca 


cgatcttccc 


cagcgtgatc 


atccagcagc 


aggtcaacag 


cgtatcgacc 


2640 


cgccacatcc 


agcccaacgg 


tcacggctcc 


ttcgatttcg 


tctggaccca 


cttcggcttc 


2700 


gaggacgaca 


acgaggagtg 


gacccagcgc 


cgcctgatcc 


aggccaacct 


gttcgggccg 


2760 


gcgggcttcg 


tgtcggccga 


tgacggcgag 


gtgatcgagt 


ggtcgcagga 


aggctttgag 


2820 


caaaaaccga 


cgcaccgcac 


cgtgatcgag 


^tgggcggtc 


acgaaatcgg 


cgacacggac 


2880 


lOcacatggtca 


ccgagacgct 


gatccgcggc 


atgtacgact 


actggcgcaa 


ggtgatgggg 


2940 


gaataaacat 


ggtagacttc 


aaaacctatt 


tcgaactgct 


gaacctgtac 


agcgactacg 


3000 


ccatggtgtg 


cgactccgcc 


aattgggaga 


agtggcctga 


tttcttcatc 


gagaccggca 


3060 


cctaccgcct 


gcaaccgcgc 


gaaaacttcg 


agcaggactt 


gccgctgtgt 


ctgctggcgc 


3120 


tggagagcaa 


ggccatgatt 


cgtgaccgag 


tgtacggtgt 


caaggaaacc 


atgtaccacg 


3180 


15atccctacta 


ccagcgccac 


atcgtaggca 


cgccgcgcgt 


gctgtcagtg 


gagcgtgatg 


3240 


cggacggcga 


gcgcatcacc 


gccgaagcca 


gctatgccgt 


gattcgcacc 


aagtacgacg 


3300 


gcgattccac 


gattttcaac 


gccggctatt 


accgagacgt 


gatcgtgcgc 


acgcccgagg 


3360 


gcctcaagct 


gaagtcgcgc 


ctgtgcgtgt 


acgacagcga 


aatgattccc 


aactccatca 


3420 


tctaccctat 


ctgagaagga 


atccaatgag 


cgagaactgg 


atcgacgccg 


ccgcccgcga 


3480 


20cgaggtgcca 


cgagggcgac 


gtgatcggca 


tcaatatcgt 


cggcaaggag 


attgcctcta 


3540 


C9^99tggcg 


ggcgagatct 


acgccaccga 


caacacctgc 


actcacggcg 


ccgcccgcat 


3600 


gagcgatggc 


tttctcgaag 


gccgggaaat 


tgaatgtcct 


ttgcatcaag 


gccgattcga 


3660 


tgtttgcacg 


ggtaaagcct 


tgtgcacacc 


cctgacacag 


gacatcaaaa 


cctaccccgt 


3720 


aaaaatcgaa 


aacatgcgcg 


tgatgctcaa 


gctggactaa 


atgctcaagc 


tggactaaaa 


3780 


25ctctttgcag 


gaggaaagcc 


aaatccggaa 


atcaccccac 


ccaacccaat 


cactacccgt 


3840 


tttcaaacaa 


gatgagacaa 


gcaattatga 


gttaccaaaa 


cttagtgagt 


gaagcagggc 


3900 


tgacgcaaaa 


gcacctgatt 


tatggcgaca 


aagaactttt 


ccagcacgaa 


ttgaagacca 


3960 


tcttcgcgcg 


gaactggctt 


tttctgaccc 


atgacagtct 


gattcccCcc 


cccggcgact 


4020 


atgtcaaagc 


caaaatgggc 


gtcgatgaag 


tcatcgtctc 


ccgccagaac 


gatggctcgg 


4080 


30tgcgagcctt 


tttgaatgtt 


tgccgtcacc 


ggggcaagac 


aatagttgac 


gctgaagccg 


4140 


gaaatgcgaa 


aggctttgtg 


tgcggttacc 


acggctgggg 


ctatggctcc 


aacggcgaac 


4200 


tgcaaagcgt 


tccctttgaa 


aaagagttgt 


acggagatgc 


gatcaaaaag 


aaatgcctgg 


4260 


gcttgaaaga 


agtcccccgc 


atcgaaagct 


ttcatggctt 


tatctatggc 


tgttttgatg 


4320 


cagaagctcc 


cccgctcatc 


gattatctgg 


gtgatgtagc 


ctggtacctg 


gaacccacct 


4380 


35tcaagcactc 


tggtggcctg 


gaacttgtag 


gcccccccgc 


caaagtggtg 


gttaagggca 


4440 


actggaaggt 


ttttgcggaa 


aactttgtag 


gtgacatcta 


ccacattggt 


tggacgcacg 


4500 


catctatttt 


gcgcgcaggg 


caggcgatat 


ttgctcctct 


tgcgggcaac 


gctatgcttc 


4560 


cacccgaagg 


cacgggcttg 


caagcgacca 


ccaagtatgg 


cagtggaatt 


ggcgtatcgt 


4620 


tggacgccta 


ctccggtgtc 


cagagcgctg 


atctggtCcc 


cgaaatgatg 


gcattcggcg 


4680 


40gcgcaaaaca 


ggaaaagctc 


gccaaagaaa 


tcggcgatgt 


ccgggcgcgg 


atttaccgca 


4740 



42 



gccaagtgaa cggcacggtt ttcccgaaca actgctttfct gaccggcgcc ggtgtcttca 4800 

aggtctttaa cccgabcgat gaaaacacga ccgaggcttg gacgtatgcc atcgtagaaa 4 860 

aagacatgcc tgaggactta aagcgtcgct tggctgacgc ggctcagcgc tctgtcggac 4920 

cagcaggata ctgggaaagc gacgacaacg acaacatggt gttgtcgcaa aatgccaaga 4 980 

Saataccaatc cagcaacagt gatctgattg ccgatttggg tttcggcaag gacgtctacg 504 0 

gcgacgaatg ctatccgggc gtcgttagca aatcggcatfc cagcgaaacc aaccatcgcg 5100 

gattctaccg tgcctaccag gctcacatca gcagctccaa ttgggccgag ttcgaaaaca 5160 

cctcccgaaa ttggcacacc gaactcacca agacgactga tcgctaatcc aggagccaat 5220 

catgatgatc aatacccagg aagacaagct ggtctccgcg cacgacgccg aagaatttca 5280 

lOccgtttcttc gtcgggcacg acagcgatct gcagcaagaa gtcaccacac tcctgacccg 5340 

cgaagccgac ctgctggaca ttcaggccta caaagcctgg cttgaacact gcgttgcccc 5400 

cgagatcaaa taccaagtga tctcgcgaga acttcgctcc acttccgagc gtcgatacca 5460 

actgaatgat gcggtgaata tctacaacga gaactatcaa cagctgaaag ttcgagttga 5520 

acaccagatg gatcctcaga actggtacaa cagcccgaag atccgcttca cccgcttcgt 558 0 

IScaccaatgtc acggcggcca aggacaagag cgcaccggaa atgctgcatg tgcggtccaa 5640 

cctcattctc catcgcgcca gacgaggaaa ccaagttgac gtcttctatg caacgcgaCga 5700 

agacaaatgg aaacgcatcg aaggtggtgg catcaaattg gtcgaacgct ttgtggacta 5760 

cccggagcgc agtccccaaa cccacaacct gatgatcttc ctgtgagccc tggggatgcc 5820 

tgcctggatg gcgggcattc gtgattattt ttaacagggc ggattcaaaa gtgaagtgca 5880 

20acaccctgga tttcagtgaa tgagagtgga gtgctgcggg ctattcacaa gcagttcgcg 5940 

gtagaccgcc agcggtgatc gtactcccag ccctttcctt ggccggttgt tgatttcatc 6000 

ggcaatcgca tcgagctgct cctggctgta gatgctcaga tccgtccctt tgggcaggta 6060 

ctggcgcacc aggccattca tgttctcgtt ggagcctctt tgccaagggc tgtgcgggtc 6120 

gcagaagtac acggctatgc cggtttgctc actgagcttc ttgtgcatgg acatctcccg 6180 

25gccctggtcg tacgtcatgc tcaaccgcat cggctgcgca atgcccagca gcttgtccgt 6240 

gaaggcctgc aagacgttgg cagcactggc cggcttgaac tcaggcagct tgaccagcat 6300 

caccagccgg ctggtgcgtt ccaccaaggt cccgactgcg ctggcattgc cttctccctt 6360 

gatgagatca ccttcccagt gcccgggaaa ctggcggtct tcgatctcgg gcgggcgtac 642 0 

atggatgctc agcatgtcgg gaatctggcc tctgcggtcc ttgcccttgc tgcgcggcag 6480 

30ccgcttgttg tgggcgtgac gcagggtggc gatcagctct ttgcgcagct ctcccacggg 6540 

catggcatag atgcagttgt agatggtttc gtgtgacacg cggtattcat ggcccagggg 6600 

atacaaacgt gccagtgtca gggcaatctg ctcgggcgac cagcgcagcc gcagcagatg 6660 

gatcaccagg caggacaaga tcgactcggg atgcagcttg atagctggtc tgccacagcg 6720 

tctgcgctgc agggactggc actgggcatg tgcgctgcca tagccagatg aagatgcat 6779 

35 

<210> 14 
<211> 449 
40 <212> PRT 
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<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 3 

5 

<400> 14 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 

1 5 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
10 20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
50 55 50 

15 He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 

85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
20 100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
130 135 140 

25 His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
X45 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 

165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
30 180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
210 215 220 

35 Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 

245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
40 260 265 270 



Gly Gly Ala Lys Gin 
275 

Ala Arg lie Tyr Arg 
290 

5 Ser Met Leu Thr Cys 
305 

Ala Asn Thr Thr Glu 
325 

Pro Glu Asp Leu Lys 
10 340 
Gly Pro Ala Gly Phe 
355 

Ala Ser Gin Asn Gly 
370 

15 Ser Asn Leu Gly Phe 
385 

Gly Val Val Gly Lys 
405 

Tyr Arg Ala Tyr Gin 
20 420 

Glu His Ala Ser Ser 
435 

Arg 



44 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Glu Asp Val Tyr 
390 

Ser Ala He Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 
440 



Glu He Gly Asp Val Arg 

285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro He Asp 
315 320 
He Val Glu Lys Asp Met 
335 

Ser Val Gin Arg Thr Val 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu Leu 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 
445 



<210> 15 
<211> 449 
<2X2> PRT 

<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 4 



<400> 15 

35 Met Asn Tyr Asn Asn Lys He Leu 
1 5 
Lys His Leu He His Gly Asp Glu 
20 

Thr He Phe Ala Arg Asn Trp Leu 
40 35 40 



Val Ser Glu Ser Gly Leu Ser Gin 

10 15 
Glu Leu Phe Gin His Glu Leu Lys 
25 30 
Phe Leu Thr His Asp Ser Leu He 
45 



45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly lie Asp Glu Val 

50 \ 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

5 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 ^ 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
10 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

15 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
20 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

25 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
30 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

35 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
40 355 360 365 



Ala Ser Gin Asn Gly Lys Lys Tyr 

370 375 
Ser Asn Leu Gly Phe Gly Glu Asp 
385 390 
5 Gly Val Val Gly L»ys Ser Ala He 
405 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
10 435 440 

Arg 



46 

Gin Ser Arg Asp Ser Asp Leu Leu 
380 

Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 16 
15 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

20 <223> A polypeptide encoded by SEQ ID NO: 5 

<400> 16 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

25 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 - 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
30 50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

35 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
40 130 135 140 



47 



His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

5 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
10 210 215 220 

Cys Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

15 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
20 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

25 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
30 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

35 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

40 



48 

<210> 17 
<211> 449 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<22 3> A polypeptide encoded by SEQ ID NO: 6 
<;400> 17 

10 Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Phe Gly Leu Ser Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
15 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

20 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
25 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

30 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
35 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

40 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 



49 



245 250 255 

Tyr Ser Gly Val .^iis Ser Ala Asp Leu Val Pro Giu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
5 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

10 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
15 355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 

370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

20 Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

Tyr Arg Ala Tyr Arg Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 

420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
25 435 440 445 

Arg 



<210> 18 
30 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> A polypeptide encoded by SEQ ID NO: 7 

<400> 18 

Met Asn Tyr Lys Asn Lys He Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

40 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Arg 



50 



20 25 30 

Thr He Phe Ala ^Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
5 50 55 60 

lie Val Ser Arg Gin Ser Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Asn Ala Glu Ala Gly Asn Ala 
85 90 95 

10 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
15 130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His 
165 170 175 

20 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
25 210 215 220 

Ala Ser Leu Ala GJ.y Asn Ala Val Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

30 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ser Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
35 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

40 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 



65 



51 



145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 

165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
5 180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 
210 215 220 

10 Cys Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 

245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
15 260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 280 285 

Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
290 295 300 

20 Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 

325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
25 340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
370 375 380 

30 Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe 

405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
35 420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 



<210> 17 
<211> 449 
<212> PRT 



52 

85 90 95 

hys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ser Leu 
5 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Ser Leu Met 
145 150 155 160 

10 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
15 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Thr Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Val Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

20 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Pro 
25 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Val Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

30 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ala Val Gin Arg Thr Val 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
35 355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu He 

370 375 380 

Ser Asn Leu Gly Phe Gly Lys Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

40 Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 



40B 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu Asp Ala Ser Ser Tiir Trp His 
5 435 440 

Arg 



53 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 20 
10 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

15 <223> A polypeptide encoded by SEQ ID NO: 9 

<400> 20 

Met Asn Tyr Lys Asn Lys He Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

20 Lys His Leu lie His Gly Gly Glu Gly Leu Phe Gin His Glu Leu Arg 
20 25 30 

Ala Val Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
25 50 55 60 

He Val Ser Arg Gin Ser Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

cys Arg His Arg Gly Lys Thr Leu Val Asn Ala Glu Ala Gly Asn Ala 
85 90 95 

30 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
35 130 135 140 

His Gly Phe He Tyr Ala Cys He Asp Gin Glu Ala Pro Ser Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His 
165 170 175 

40 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 



180 

Ala Asn Trp Lys Ala 
195 

Val Gly Trp Thr His 
5 210 

Ser Ser Leu Ala Gly 
225 

Gin Met Thr Ser Lys 
245 

10 Tyr Ser Gly Val His 
260 

Gly Gly Ala Lys Gin 
275 

Ala Arg lie Tyr Arg 
15 290 

Ser Val Leu Thr Cys 
305 

Ala Asn Thr Thr Glu 
325 

20 Pro Glu Asp Leu Lys 
340 

Gly Pro Ala Gly Phe 
355 

Ala Ser Gin Asn Gly 
25 370 

Ser Asn Leu Gly Phe 
385 

Gly Val Val Gly Lys 
405 

30 Tyr Arg Ala Tyr Gin 
420 

Glu Asp Ala Ser Ser 
435 

Arg 

35 



54 

185 

Pro Ala Glu Asn Phe 
200 

Ala Ser Ser Leu Cys 
215 

Asn Ala Val Leu Pro 
230 

Tyr Gly Ser Gly Met 
250 

Ser Ala Asp Leu Val 
265 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Lys Asp Val Tyr 
390 

Ser Ala lie Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 
440 



190 

Val Gly Asp Ala Tyr His 
205 

Thr Gly Glu Ser lie Phe 
220 

Pro Glu Gly Ala Gly Leu 
235 240 
Gly Val Leu Trp Asp Gly 
255 

Pro Glu Leu Met Ala Phe 
270 

Glu lie Gly Asp Val Arg 
285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro lie Asp 
315 320 
He Val Glu Lys Asp Met 
335 

Ala Val Gin Arg Thr Val 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu He 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 
445 



<210> 21 
<211> 449 
<212> PRT 
40 <213> Artificial 



Sequence 



55 

<220> 

<223> A polypeptide encoded by SEQ ID NO: 10 



<400> 21 

5 Met Asn Tyr Lys Asn Lys Asn Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin Arg Glu Leu Glu 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
10 35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

15 Cys Arg His Arg Gly Lys Thr Leu Val His Ala Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ala Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ala Leu 
20 115 120 125 

Asp Lys Lys Cys Met Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Glu Glu Ala Pro Ser Leu Lys 
145 150 155 160 

25 Asp Tyr Met Gly Asp Ala Gly Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu He Gly Pro Pro Gly Lys Val He He Lys 

180 185 130 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Thr Gly Asp Ala Tyr His 
30 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Gin Ser Val Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

35 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Glu Val Arg 
40 275 280 285 



56 



Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Phe Leu Thr Cys Ser Gly Val Phe Lys Val Trp His Pro lie Asp 
305 310 315 320 

5 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala Met Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Val Asp Ala Val Gin Arg Thr Val 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
10 355 360 365 

Val Ser Gin Asn Ala Lys Lys Tyr Gin Ser Arg Asp Gly Asp Leu Val 

370 375 380 

Ser Asn Leu Gly Phe Gly Gly Asp Val Tyr Gly Asp Glu Val Tyr Pro 
385 390 395 400 

15 Gly He Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

Tyr Arg Ala Tyr Gly Ala His He Ser Ser Ser Ser Trp Ala Glu Phe 

420 425 430 

Glu Asp Val Ser Lys Asn Trp His Thr Glu Leu Ala Lys Thr Thr Asp 



20 



435 



440 



445 



Arg 



<210> 22 
25 <211> 447 

<212> PRT 

<213> Artificial Sequence 
<220> 

30 <223> A polypeptide encoded by SEQ ID NO: 11 

<400> 22 

Met He Tyr Glu Asn Leu Val Ser Glu Ala Gly Leu Thr Gin Lys His 
15 10 15 

35 Leu He His Gly Asp Lys Glu Leu Phe Gin His Glu Leu Lys Thr He 
20 25 30 

Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He Pro Ser 

35 40 45 

Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val He Val 
40 50 55 60 



57 

Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe Leu Asn Val Cys Arg 
65 70 75 80 

His Arg Gly Lys Thr Leu Val His Ala Glu Ala Gly Asn Ala Lys Gly 
85 90 95 

5 Phe val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly Glu Leu 
100 105 110 

Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Asp Thr lie Lys Lys 

115 120 125 

Lys Cys Leu Gly Leu Lys Glu Val Pro Arg lie Glu Ser Phe His Gly 
10 130 135 140 

Phe lie Tyr Gly Cys Phe Asp Ala Glu Ala Pro Thr Leu Val Asp Tyr 
145 150 155 160 

Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro lie Phe Lys His Ser Gly 
165 170 175 

15 Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys Ala Asn 
180 185 190 

Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His Val Gly 

155 200 205 

Trp Thr His Ala Ser Ser Leu Arg Ser Gly Gin Ser lie Phe Thr Pro 
20 210 215 220 

Leu Ala Gly Asn Ala Met Leu Pro Pro Glu Gly Ala Gly Leu Gin Met 
225 230 235 240 

Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly Tyr Ser 
245 250 255 

25 Gly Val His Ser Ala Asp Leu Val Pro Glu Met Met Ala Phe Gly Gly 
260 265 _ 270 

Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly Asp Val Arg Ala Arg 

275 280 285 

He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn Ser He 
30 290 295 300 

Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp Glu Asn 
305 310 315 320 

Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met Pro Glu 
325 330 335 

35 Asp Leu Lys Arg Arg Leu Ala Asp Ala Val Gin Arg Thr Val Gly Pro 
340 345 350 

Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr Glu Ser 

355 360 365 

Gin Asn Ala Lys Lys Tyr Gin Ser Ser Asn Ser Asp Leu He Ala Asn 
40 370 375 380 



58 

Leu Gly Phe Gly Lys Asp Val Tyr Gly Asp Glu Cys Tyr Pro Gly Val 
385 390 395 400 

Val Ala Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe Tyr Arg 
405 410 415 

5 Ala Tyr Gin Ala His lie Ser Ser Ser Asn Trp Ala Glu Phe Glu Asn 
420 425 430 

Thr Ser Arg Asn Trp His Thr Glu Leu Thr Lys Thr Thr Asp Arg 
435 440 445 

10 <210> 23 

<211> 447 
<212> PRT 

<213> Artificial Sequence 

15 <220> 

<223> A polypeptide encoded by SEQ ID NO: 12 

<400> 23 

Met Ser Tyr Gin Asn Leu Val Ser Glu Ala Gly Leu Thr Gin Lys Leu 
20 1 5 10 15 

Leu He His Gly Asp Lys Glu Leu Phe Gin His Glu Leu Lys Thr He 

20 25 30 

Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He Pro Ser 
35 40 45 

25 Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val He Val 
50 55 60 

Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe Leu Asn Val Cys Arg 
65 70 75 80 

His Arg Gly Lys Thr Leu Val His Thr Glu Ala Gly Asn Ala Lys Gly 
30 85 90 95 

Phe Val Cys Gly Tyr His Gly Trp Gly Tyr Gly Ser Asn Gly Glu Leu 

100 105 110 

Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Asp Ala He Lys Lys 
115 120 125 

35 Lys Cys Leu Gly Leu Lys Glu Val Pro Arg He Glu Ser Phe His Gly 
130 135 140 

Phe He Tyr Gly Cys Phe Asp Ala Glu Ala Pro Pro Leu He Asp Tyr 
145 150 155 160 

Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Thr Phe Lys His Ser Gly 
40 165 170 175 



59 



Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val Val Lys Ala Asn 

180 185 190 

T3rp Lys Pro Phe Ala Glu Asn Phe Val Gly Asp lie Tyr His Val Gly 
195 200 205 

5 Trp Thr His Ala Ala Ala Leu Arg Ala Gly Gin Ser Val Phe Ser Ser 
210 215 220 

Leu Ala Gly Asn Ala Lys Leu Pro Pro Glu Gly Ala Gly Leu Gin Met 
225 230 235 240 

Thr Ser Lys Tyr Gly Ser Gly Met Gly Leu Thr Trp Asp Tyr Tyr Ser 
10 245 250 255 

Gly Asn Phe Ser Ala Asp Met Val Pro Asp Leu Met Ala Phe Gly Ala 

260 265 270 

Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly Asp Val Arg Ala Arg 
275 280 285 

15 He Tyr Arg Ser He Leu Asn Gly Thr Val Phe Pro Asn Asn Ser Phe 
290 295 300 

Leu Thr Gly Ser Ala Thr Phe Lys Val Trp Asn Pro He Asp Glu Asn 
305 310 315 320 

Thr Thr Glu Val Trp Thr Tyr Ala Phe Val Glu Lys Asp Met Pro Glu 
20 325 330 335 

Asp Leu Lys Arg Arg Leu Ala Asp Ala Ala Gin Arg Ser Val Gly Pro 

340 345 350 

Ala Gly Phe Trp Glu Ser Asp Asp Asn Glu Asn Met Glu Thr Leu Ser 
355 360 365 

25 Gin Asn Ala Lys Lys Tyr Gin Ser Ser Asn Ser Asp Gin He Ala Ser 
370 375 380 

Leu Gly Phe Gly Lys Asp Val Tyr Gly Asp Glu Cys Tyr Pro Gly Val 
385 390 395 400 

Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe Tyr Arg 
30 405 410 415 

Ala Tyr Gin Ala His He Ser Ser Ser Asn Trp Ala Glu Phe Glu Asn 

420 425 430 

Ala Ser Arg Asn Trp His Thr Glu Leu Thr Lys Thr Thr Asp Arg 

435 440 445 

35 

<210> 24 
<211> 451 
<212> PRT 

<213> Artificial Sequence 

40 



60 

<220> 

<223> A polypeptide encoded by SEQ ID NO: 13 



<400> 24 

5 Met Arg Gin Ala lie Met Ser Tyr Gin Asn Leu Val Ser Glu Ala Gly 
15 10 15 

Leu Thr Gin Lys His Leu lie Tyr Gly Asp hys Glu Leu Phe Gin His 

20 25 30 

Glu Leu Lys Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp 
10 35 40 45 

Ser Leu He Pro Ser Pro Gly Asp Tyr Val Lys Ala Lys Met Gly Val 

50 55 60 

Asp Glu Val He Val Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe 
65 70 75 80 

15 Leu Asn Val Cys Arg His Arg Gly Lys Thr He Val Asp Ala Glu Ala 
85 90 95 

Gly Asn Ala Lys Gly Phe Val Cys Gly Tyr His Gly Trp Gly Tyr Gly 

100 105 110 

Ser Asn Gly Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly 
20 115 120 125 

Asp Ala He Lys Lys Lys Cys Leu Gly Leu Lys Glu Val Pro Arg He 

130 135 140 

Glu Ser Phe His Gly Phe He Tyr Gly Cys Phe Asp Ala Glu Ala Pro 
145 150 155 160 

25 Pro Leu He Asp Tyr Leu Gly Asp Val Ala Trp Tyr Leu Glu Pro Thr 
165 170 175 

Phe Lys His Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Ala Lys Val 

180 185 190 

Val Val Lys Gly Asn Trp Lys Val Phe Ala Glu Asn Phe Val Gly Asp 
30 195 200 205 

He Tyr His He Gly Trp Thr His Ala Ser He Leu Arg Ala Gly Gin 

210 215 220 

Ala He Phe Ala Pro Leu Ala Gly Asn Ala Met Leu Pro Pro Glu Gly 
225 230 235 240 

35 Thr Gly Leu Gin Ala Thr Thr Lys Tyr Gly Ser Gly lie Gly Val Ser 
245 250 255 

Leu Asp Ala Tyr Ser Gly Val Gin Ser Ala Asp Leu Val Pro Glu Met 

260 265 270 

Met Ala Phe Gly Gly Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly 
40 275 280 285 



61 



Asp Val Arg Ala Arg 
290 

Pro Asn Asn Cys Phe 
305 

5 Pro He Asp Glu Asn 

325 

Lys Asp Met Pro Glu 
340 

Arg Ser Val Gly Pro 
10 355 

Met Val Leu Ser Gin 
370 

Leu He Ala Asp Leu 
385 

15 Tyr Pro Gly Val Val 
405 

Gly Phe Tyr Arg Ala 
420 

Glu Phe Glu Asn Thr 
20 435 
Thr Asp Arg 
450 



He Tyr Arg Ser Gin 
295 

Leu Thr Gly Ala Gly 
310 

Thr Thr Glu Ala Trp 
330 

Asp Leu Lys Arg Arg 
345 

Ala Gly Tyr Trp Glu 
360 

Asn Ala Lys Lys Tyr 
375 

Gly Phe Gly Lys Asp 
390 

Ser Lys Ser Ala Phe 
410 

Tyr Gin Ala His He 
425 

Ser Arg Asn Trp His 
440 



Val Asn Gly Thr Val Phe 
300 

Val Phe Lys Val Phe Asn 
315 320 
Thr Tyr Ala He Val Glu 
335 

Leu Ala Asp Ala Ala Gin 
350 

Ser Asp Asp Asn Asp Asn 
365 

Gin Ser Ser Asn Ser Asp 
380 

Val Tyr Gly Asp Glu Cys 
395 400 
Ser Glu Thr Asn His Arg 
415 

Ser Ser Ser Asn Trp Ala 
430 

Thr Glu Leu Thr Lys Thr 
445 



<210> 25 
25 <211> 2265 

<212> DNA 

<213> Pseudomonas sp. 



<400> 25 

30 gagggtagag aaatcgaatg ccccttgcat 
gccctgtgcg cacccgtgac acagaacatc 
cgcgtaatga ttgatttgag ctaagaattt 
aatcaccccc attccatctt ttttaggtga 
aagtgaatct ggtctgagcc aaaagcacct 

35 tgaactgaaa accatttttg cgcggaactg 
tgcccccggc gactatgtta ccgcaaaaat 
gaacgacggt tcgattcgtg cttttctgaa 
gagcgtggaa gccggcaatg ccaaaggttt 
ctccaacggt gaactgcaga gcgttccatt 

40 taaaaaatgt ctggggttga aagaagtcgc 



caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 


taacaggagg 


caccccgggc 


cctagagcgt 


180 


aaacatgaat 


tacaataata 


aaatcttggt 


240 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 


gctttttctc 


actcatgata 


gcctgattcc 


360 


gggsattgac 


gaggtcatcg 


tctcccggca 


420 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 



62 





cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc caggcaaggt 


780 




tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 


5 


caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 


10 


cgccattgfcc gaaaaagaca tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgttc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa accttggttt 


1380 




cggtgaggac gtatacggcg acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 


15 


ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


20 


tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


Qgggcaacag cccgaagctg 


1920 




cgctttactc 


gctfctatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


25 


cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat ggtctttctg 


2X60 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 26 
30 <211> 449 

<212> PRT 

<213> Pseudomonas sp. 



<400> 26 

35 Met Asn Tyr Asn Asn Lys lie Leu 
1 5 
Lys His Leu He His Gly Asp Glu 
20 

Thr He Phe Ala Arg Asn Trp Leu 
40 35 40 



Val Ser Glu Ser Gly Leu Ser Gin 

10 15 
Glu Leu Phe Gin His Glu Leu Lys 

25 30 

Phe Leu Thr His Asp Ser Leu He 
45 



63 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 30 

5 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
10 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

15 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
20 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

25 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
30 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

35 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Phe 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
40 355 360 365 



Ala Ser Gin Asn Gly Lys Lys Tyr 

370 375 
Ser Asn Leu Gly Phe Gly Glu Asp 
385 390 
5 Gly Val Val Gly Lys Ser Ala He 
405 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
10 435 440 

Arg 



64 

Gin Ser Arg Asp Ser Asp Leu Leu 
380 

Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 

425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 27 
15 <211> 2265 

<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> A modified DNA sequence 



<400> 27 





ga-gggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


eo 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 


25 


cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


g999^ttgac 


gaggtcatcg 


tctcccggca 


420 


30 gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 


35 


cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 


40 gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 



65 



tcggatttat cgcagccacc tcaactgcac cgttttcccg aacaacagca tgctgacctg 1140 

ctcgggtgtt ttcaaagtat ggaacccgat cgacgcaaac accaccgagg tctggaccta 1200 

cgccattgtc gaaaaagaca tgcctgagga tctcaagcgc cgcttggccg actctgttca 1260 

gcgaacgggc gggcctgctg gcttctggga aagcgacgac aatgacaata tggaaacagc 1320 

5 ttcgcaaaac ggcaagaaat atcaatcaag agatagtgat ctgctttcaa accttggttt 1380 

cggtgaggac gtatacggcg acgcggtcta tccaggcgtc gtcggcaaat cggcgatcgg 1440 

cgagaccagt tatcgtggtt tctaccgggc ttaccaggca cacgtcagca gctccaactg 1500 

ggctgagttc gagcatgcct ctagtacttg gcatactgaa cttacgaaga ctactgatcg 1560 

ctaacagacg agtcgaccat gatgatcaat attcaagaag acaagctggt ttccgcccac 1620 

10 gacgccgaag agattcttcg tttcttcaat tgccacgact ctgctttgca acaagaagcc 1680 

actacgctgc tgacccagga agcgcatttg ttggacattc aggcttaccg tgcttggtta 1740 

gagcactgcg tggggtcaga ggtgcaatat caggtcattt cacgcgaact gcgcgcagct 1800 

tcagagcgtc gttataagct caatgaagcc atgaacgttt acaacgaaaa ttttcagcaa 1860 

ctgaaagttc gagttgagca tcaactggat ccgcaaaact ggggcaacag cccgaagctg 1920 

15 cgctttactc gctttatcac caacgtccag gccgcaatgg acgtaaatga caaagagcta 1980 

cttcacatcc gctccaacgt cattctgcac cgggcacgac gtggcaatca ggtcgatgtc 2040 

ttctacgccg cccgggaaga fcaaatggaaa cgtggcgaag gtggagtacg aaaattggtc 2100 

cagcgattcg tcgattaccc agagcgcata cttcagacgc acaatctgat ggtctttctg 2160 

tgattcagtg accattttta caaatggtca ctgcaaccgc ggtcaccatt aatcaaaggg 2220 

20 aatgtacgtg tatgggcaat caacaagtcg tttcgataac cggtg 2265 

<210> 28 
<211> 2265 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 
30 <400> 28 

gagggtagag aaatcgaatg ccccttgcat caaggtcggt ttgacgtttg cacaggcaaa 60 

gccctgtgcg cacccgtgac acagaacatc aaaacatatc cagtcaagat tgagaacctg 120 

cgcgtaatga ttgatttgag ctaagaattt taacaggagg caccccgggc cctagagcgt 180 

aatcaccccc attccatctt ttttaggtga aaacatgaat tacaataata aaatcttggt 240 

35 aagtgaatct ggtctgagcc aaaagcacct gattcatggc gatgaagaac ttttccaaca 300 

tgaactgaaa accatttttg cgcggaactg gctttttctc actcatgata gcctgattcc 360 

tgcccccggc gactatgtta ccgcaaaaat ggggattgac gaggtcatcg tctcccggca 420 

gaacgacggt tcgattcgtg cttttctgaa cgtttgccgg catcgtggca agacgctggt 480 

gagcgtggaa gccggcaatg ccaaaggttt tgtttgcagc tatcacggct ggggcttcgg 540 

40 ctccaacggt gaactgcaga gcgttccatt tgaaaaagat ctgtacggcg agtcgctcaa 600 



66 





taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg cataccacgt 


840 


5 


gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca tgctgacctg 


1140 


10 ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg actctgttca 


1260 




gcgaacggcc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat cggcgatcgg 


1440 


15 


cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg tgcttggtta 


1740 


20 


gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact gcgcgcagct 


1800 




tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca ggtcgatgtc 


2040 


25 


ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg aaaattggtc 


2100 




cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 


2265 



30 <210> 29 

<211> 2265 
<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> A modified DNA sequence 

<400> 29 

gagggtagag aaatcgaatg ccccttgcat caaggtcggt ttgacgtttg cacaggcaaa 60 
40 gccctgtgcg cacccgtgac acagaacatc aaaacatatc cagtcaagat tgagaacctg 120 



67 



cgcgtaatga ttgatttgag ctaagaattt taacaggagg caccccgggc cctagagcgt 180 

aatcaccccc attccatctt ttttaggtga aaacatgaat tacaataata aaatcttggt 240 

aagtgaatct ggtctgagcc aaaagcacct gattcatggc gatgaagaac ttttccaaca 300 

tgaactgaaa accatttttg cgcggaactg gctttttctc actcatgata gcctgattcc 360 

5 tgcccccggc gactatgtta ccgcaaaaat ggggattgac gaggtcatcg tctcccggca 420 

gaacgacggt tcgattcgtg cttttctgaa cgtttgccgg catcgtggca agacgctggt 480 

gagcgtggaa gccggcaatg ccaaaggttt tgtttgcagc tatcacggct ggggcttcgg 540 

ctccaacggt gaactgcaga gcgttccatt tgaaaaagat ctgtacggcg agtcgctcaa 600 

taaaaaatgt ctggggttga aagaagtcgc tcgcgtggag agcttccatg gcttcatcta 660 

10 cggttgcttc gaccaggagg cccctcctct tatggactat ctgggtgacg ctgcttggta 720 

cctggaacct atgttcaagc attccggcgg tttagaactg gtcggtcctc caggcaaggt 780 

tgtgatcaag gccaactgga aggcacccgc ggaaaacttt gtgggagatg cataccacgt 840 

gggttggacg cacgcgtctt cgcttcgctc gggggagtct atcttctcgt cgctcgctgg 900 

caatgcggcg ctaccacctg aaggcgcagg cttgcaaatg acctccaaat acggcagcgg 960 

15 catgggtgtg ttgtgggacg gatattcagg tgtgcatagc gcagacttgg ttccggaatt 1020 

gatggcattc ggaggcgcaa agcaggaaag gctgaacaaa gaaattggcg atgttcgcgc 1080 

tcggatttat cgcagccacc tcaactgcac cgttttcccg aacaacagca tgctgacctg 1140 

ctcgggtgtt ttcaaagtat ggaacccgat cgacgcaaac accaccgagg tctggaccta 1200 

cgccattgtc gaaaaagaca tgcctgagga tctcaagcgc cgcttggccg actctgttca 1260 

20 gcgaacgacc gggcctgctg gcttctggga aagcgacgac aatgacaata tggaaacagc 1320 

ttcgcaaaac ggcaagaaat atcaatcaag agatagtgat ctgctttcaa accttggttt 1380 

cggtgaggac gtatacggcg acgcggtcta tccaggcgtc gtcggcaaat cggcgatcgg 1440 

cgagaccagt tatcgtggtt tctaccgggc ttaccaggca cacgtcagca gctccaactg 1500 

ggctgagttc gagcatgcct ctagtacttg gcatactgaa cttacgaaga ctactgatcg 1560 

25 ctaacagacg agtcgaccat gatgatcaat attcaagaag acaagctggt ttccgcccac 1620 

gacgccgaag agattcttcg tttcttcaat tgccacgact ctgctttgca acaagaagcc 1680 

actacgctgc tgacccagga agcgcatttg ttggacattc aggcttaccg tgcttggtta 1740 

gagcactgcg tggggtcaga ggtgcaatat caggtcattt cacgcgaact gcgcgcagct 1800 

tcagagcgtc gttataagct caatgaagcc atgaacgttt acaacgaaaa ttttcagcaa 1860 

30 ctgaaagttc gagttgagca tcaactggat ccgcaaaact ggggcaacag cccgaagctg 1920 

cgctttactc gctttatcac caacgtccag gccgcaatgg acgtaaatga caaagagcta 1980 

cttcacatcc gctccaacgt cattctgcac cgggcacgac gtggcaatca ggtcgatgtc 2040 

ttctacgccg cccgggaaga taaatggaaa cgtggcgaag gtggagtacg aaaattggtc 2100 

cagcgattcg tcgattaccc agagcgcata cttcagacgc acaatctgat ggtctttctg 2160 

35 tgattcagtg accattttta caaatggtca ctgcaaccgc ggtcaccatt aatcaaaggg 2220 

aatgtacgtg tatgggcaat caacaagtcg tttcgataac cggtg 2265 



40 



<210> 30 
<211> 2265 
<212> DNA 



68 



<213> Artificial Sequence 
<220> 

<223> A modified 0NA sequence 

5 



<400> 30 





gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 


10 


aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 


15 


gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 


20 


tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 


25 


tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgctc gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 


30 


cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 




ggctgagttc gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 


35 actacgctgc tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 




tcagagcgtc gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 


40 


cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 



69 



ttctacgccg cccgggaaga taaatggaaa cgtggcgaag gtggagtacg aaaattggtc 2100 

cagcgattcg tcgattaccc agagcgcata cttcagacgc acaatctgat ggtctttctg 2160 

tgattcagtg accattttta caaatggtca ctgcaaccgc ggtcaccatt aatcaaaggg 2220 

aatgtacgtg tatgggcaat caacaagtcg tttcgataac cggtg 2265 

<210> 31 
<211> 2265 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 



<400> 31 



15 


gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 


20 


tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 


25 


taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg gcttcatcta 


660 




csgttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 


30 


caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg tctggaccta 


1200 


35 


cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgatc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 


40 


ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga ctactgatcg 


1560 



70 





ctaacagacg agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


5 


tcagagcgtc gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc gagttgagca 


tcaactggat 


ccgcaaaact 




cccgaagctg 


1920 




cgctttactc gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg aaaattggtc 


2100 


10 


cagcgattcg tcgattaccc 


agagcgcata 


cfctcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 32 
15 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

20 <223> A polypeptide encoded by SEQ ID NO: 27 

<400> 32 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

25 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
30 50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

35 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Oln Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
40 130 135 140 



71 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

5 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 
10 210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 

225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

15 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 280 285 

Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
20 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 
325 330 335 

25 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Gly 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
30 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

35 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

40 
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<210> 33 
<211> 449 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 28 



<400> 33 

10 Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
15 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

20 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 SO 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
25 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 ISO 155 160 

30 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
35 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 

225 230 235 240 

40 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 



245 

Tyr Ser Gly Val His 

260 

Gly Gly Ala Lys Gin 
5 275 
Ala Arg He Tyr Arg 
290 

Ser Met Leu Thr Cys 
305 

10 Ala Asn Thr Thr Glu 
325 

Pro Glu Asp Leu Lys 
340 

Gly Pro Ala Gly Phe 
15 355 

Ala Ser Gin Asn Gly 
370 

Ser Asn Leu Gly Phe 
385 

20 Gly Val Val Gly Lys 
405 

Tyr Arg Ala Tyr Gin 
420 

Glu His Ala Ser Ser 
25 435 
Arg 
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250 

Ser Ala Asp Leu Val 
265 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 

345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Glu Asp Val Tyr 
390 

Ser Ala He Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 
440 



255 

Pro Glu Leu Met Ala Phe 
270 

Glu He Gly Asp Val Arg 
285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro He Asp 
315 320 
He Val Glu Lys Asp Wet 
335 

Ser Val Gin Arg Thr Ala 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu Leu 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 
445 



<210> 34 
30 <2ai> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> A polypeptide encoded by SEQ ID NO: 29 

<400> 34 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
X 5 10 15 

40 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
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20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
5 50 55 60 

lie Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 50 55 

10 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 HO 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
15 130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

20 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
25 210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

30 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
35 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

40 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Thr 
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340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
5 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

10 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

15 

<210> 35 
<211> 449 
<212> PRT 
20 <213> Artificial Sequence 

<220> 

<223> A polypeptide encoded by SEQ ID NO: 30 

25 <400> 35 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 

15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

30 Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
35 65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 

85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

40 Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
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115 120 125 

Asn Lys hys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
5 145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 

165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

10 Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
15 225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 

245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

20 Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
25 305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 

325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Leu 
340 345 350 

30 Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 

370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
35 385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 

405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

40 Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
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435 



440 



445 



Arg 



5 <210> 36 

<211> 449 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> A polypeptide encoded by SEQ ID NO: 31 

<400> 36 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 1 5 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 
35 40 45 

20 Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly lie Asp Glu Val 
50 55 60 

lie Val Ser Arg Gin Asn Asp Gly Ser lie Arg Ala Phe Leu Asn Val 
65 70 75 SO 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
25 85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
115 120 125 

30 Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
35 165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
195 200 205 

40 Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
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210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Vai Leu Trp Asp Gly 
5 245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 
275 280 285 

10 Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
15 325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr lie 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
355 360 365 

20 Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
25 405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 

420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

30 Arg 



<210> 37 
<211> 22 
35 <212> DNA 

<213> Artificial Sequence 



40 



<220> 

<223> An oligonucleotide 



<400> 37 
ttcagcgaac ggtcgggcct gc 

<210> 38 
5 <211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> An oligonucleotide 

<400> 38 
gccgggcctc ttgcgggata tcgtcca 

15 <210> 39 

<211> 27 
<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> An oligonucleotide 

<400> 39 
gttgccattg ctgcaggcat cgtggtg 

25 

<210> 40 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> An oligonucleotide 
<400> 40 

35 gaggcacccg cggaagcttt tgtgggagat gca 

<210> 41 
<211> 30 
<212> DNA 
40 <213> Artificial Sequence 



<220> 

<223> An oligonucleotide 



<400> 41 

5 gcacccgcgg aacaatttgt gggagatgca 

<210> 42 
<211> 21 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

15 <400> 42 

ccgcggaaag ctttgtggga g 

<210> 43 
<211> 24 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

25 

<400> 43 
ccgcggaaaa gcttgtggga gatg 

<210> 44 
30 <211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> An oligonucleotide 

<400> 44 
cgcggaaaac gttgtgggag atg 



40 <210> 45 



81 



<211> 23 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> An oligonucleotide 

<400> 45 
atattcaggt gcgcatagcg cag 

10 

<210> 46 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> An oligonucleotide 
<400> 46 

20 ggacggatat tcagggctcc atagcgcaga cttg 

<210> 47 
<211> 33 
<212> DHA 
25 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

30 <400> 47 

gacggatatt caggtaacca tagcgcagac ttg 

<210> 48 
<211> 30 
35 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> An oligonucleotide 

40 



<400> 48 

ggtgttttca aagtcgcgaa cccgatcgac 



<210> 49 
5 <211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> An oligonucleotide 

<400> 49 
ctgttcagcg aaacttcggg cctgct 

15 <210> 50 

<211> 26 

<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> An oligonucleotide 

<400> 50 
ctgttcagcg aaggttcggg cctgct 

25 

<210> 51 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> An oligonucleotide 

<400> 51 
35 ctgttcagcg aagcttcggg cctgct 

<210> 52 
<211> 22 
<212> DNA 
40 <213> Artificial Sequence 



<220> 

<223> An oligonucleotide 



<400> 52 
5 ttcagcgaac gctcgggcct gc 

<210> 53 
<211> 30 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

15 <400> 53 

ggcctgctgg cttcgcggaa agcgacgaca 

<210> 54 
<211> 21 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

25 

<400> 54 
gaaagcgacg ccaatgacaa t 

<210> 55 
30 <211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> An oligonucleotide 

<400> 55 

acgacaatga caattgggaa acagcttcgc 



40 <210> 56 



84 

<211> 2265 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> A modified DNA sequence 



<400> 56 





gagggtagag aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 


10 


gccctgtgcg cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 


15 


tgcccccggc 


gactatgtta 


ccgcaaaaat 


gggg^ttgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag agcttccatg gcttcatcta 


660 


20 cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 


25 


catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg aacaacagca tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg tctggaccta 


1200 




cgccattgtc gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg actctgttca 


1260 


30 


gcgaacgtgg gggcctgctg 


gcttctggga 


aagcgacgac aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 


35 


ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag acaagctggt 


ttccgcccac 


1620 




gacgccgaag agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 




tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 


40 ctgaaagttc gagttgagca 


tcaactggat 


ccgcaaaact ggggcaacag cccgaagctg 


1920 



85 



cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 


cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac gtggcaatca ggtcgatgtc 


2040 


ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 


5 tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 


aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 57 
<211> 2265 
10 <212> mk 

<213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 

15 

<400> 57 





9^99gtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt ttgacgtttg cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acaqaacatc 


aaaacatatc cagtcaagat tgagaacctg 


12 0 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg caccccgggc cctagagcgt 


180 


20 


aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat tacaataata aaatcttggt 


240 




sagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc gatgaagaac ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc actcatgata gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac gaggtcatcg tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgfcttgccgg catcgtggca agacgctggt 


480 


25 gagcgtggaa gccggcaatg 


ccaaaggttt 


tgtttgcagc tatcacggct ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat ctgtacggcg agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag agcttccatg gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat ctgggtgacg ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg gtcggtcctc caggcaaggt 


780 


30 tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt gtgggagatg cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct atcttctcgt cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg acctccaaat acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc gcagacttgg ttccggaatt 


1020 




gatggcattc ggaggcgcaa 


agcaggaaag 


gctgaacaaa gaaattggcg atgttcgcgc 


1080 


35 tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg aacaacagca tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac accaccgagg tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc cgcttggccg actctgttca 


1260 




gcgaacggtc gggcctgctg 


gcttctggga 


aagcgacgac aatgacaata tggaaacagc 


1320 




ttcgcaaaac ggcaagaaat 


atcaatcaag 


agatagtgat ctgctttcaa accttggttt 


1380 


40 


cggtgaggac gtatacggcg 


acgcggtcta 


tccaggcgtc gtcggcaaat cggcgatcgg 


1440 



86 





cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca acaagaagcc 


1680 


5 


actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact gcgcgcagct 


1800 




tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga caaagagcta 


1980 


10 


cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg aaaattggtc 


2100 




cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 


2265 



15 

<210> 58 
<:211> 449 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> A polypeptide encoded by SEQ ID NO: 56 
<400> 58 

25 Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
30 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

35 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
40 115 120 125 



87 

Asn Lys Lys Cys Leu Giy Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

5 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
10 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

15 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
20 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

25 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Trp 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
30 355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 

370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

35 Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
40 435 440 445 



88 



Arg 



<210> 59 
5 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

10 <223> A polypeptide encoded by SEQ ID NO: 57 

<400> 59 

Met Asn Tyr Asn Asn hye lie hen Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

15 liys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
20 50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

25 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
30 130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

35 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
40 210 215 220 



89 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

5 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 280 285 

Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
10 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 
325 330 335 

15 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Tyr 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
20 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

25 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

30 

<210> 60 
<211> 26 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 



40 



<400> 60 



gttcagcgaa cgggcgggcc tgctgg 

<210> 61 
<2li> 26 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

10 

<400> 61 
gttcagcgaa cggccgggcc tgctgg 

<210> 62 
15 <211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> An oligonucleotide 

<400> 62 
gttcagcgaa cgaccgggcc tgctgg 

25 <210> 63 

<211> 26 
<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> An oligonucleotide 

<400> 63 
gttcagcgaa cgatcgggcc tgctgg 

35 

<210> 64 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

40 



90 

26 



26 



26 



<220> 

<223> An oligonucleotide 

<400> 64 
5 gttcagcgaa cgtgggggcc tgctgg 

<210> 65 
<211> 25 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 



15 <400> 65 

ttcagcgaac gtacgggcct gctgg 



